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AMENDMENT NO. 1 DECEMBER 2010 

TO 

IS 13194 : 1991 INDIAN SCRIPT CODE FOR 

INFORMATION INTERCHANGE — ISCII 

{Page 4, clause 4.10) — Insert the following new clause after 4.10.2: 

'4.10.3 Indian Rupee — " is the symbol for Indian Rupee.' 

[Page 5, Table 1 and 2, Hex Code {col F and row C), Dec. Code {col 240 
and row 12)] — Insert '' '. 

{Page 6, Table 3, row 'FC 252', col 'char') — Insert '' '. 

{Page 6, Table 3, row 'FC 252', col 'Name') — Substitute 'Indian Rupee 
Symbol'/or 'This position shall not be used'. 

{Page 6, Note 1) — Insert the following at the end: 

'except FC which has been assigned to Indian Rupee Symbol \' 

{Page 9, clause 9) — Insert the following new clause after 9: 

10 THE INDIAN RUPEE SYMBOL ON KEYBOARDS 

The Indian Rupee Symbol shall be placed on AltGr+4 key for inscript keyboard 
layout as well as for keyboards of QWERTY form. ' 

{Page 12, Annex A) — Insert '" ' below the last row of the table under 
each column except for column 'RMN'. 

[Page 13, Annex B, Table Hex Code {col F and row C), Dec. Code {col 

240 and row 12)] — Insert '' '. 

[Page 27, Annex H, Symbols (SYM)] — Substitute "^ ' for '?'. 
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Computer Media Sectional Commitee, LTD 37 



FOREWORD 



This Indian Standard was adopted by the Bureau of Indian Standards after the draft 
finalized by the Computer Media Sectional Committee has been approved by the Electron- 
ics and Telecommunication Division Council. 

This standard conforms to IS 10401:1982, '8-bit coded character set for information 
interchange" (equivalent to ISO 4873). It is intended for use in all computer and commu- 
nication media which allow usage of 7 or 8-bit characters, as per IS 12326:1987 (ISO 
2022:1982) "7-bit and 8-bit coded character set - code extension techniques". 

In an 8-bit environment, the lower 1 28 characters are the same as defined in IS 1 03 1 5 : 1 982 
(ISO 646 IRV) "7 -bit coded character set for information interchange" also known as 
ASCII character set. The top 128 characters cater to all the 10 Indian scripts based on the 
ancient Brahmi script. 

In a 7-bit environment the control code SI can be used for invocation of the ISCII code set, 
and control code SO can be used for reselection of the ASCII code set. 

There are 15 officially recognized languages in India: Hindi, Marathi, Sanskrit, Punjabi, 
Gujarati, Oriya, Bengali, Assamese, Telugu, Kannada, Malayalam, Tamil, Urdu, Sindhi 
and Kashmiri. 

Out of these, Urdu, Sindhi and Kashmiri are primarily written in Perso- Arabic scripts, but 
get written in Devanagari too (Sindhi is also written in the Gujarati script). Apart from 
Perso- Arabic scripts, all the other 10 scripts used for Indian languages have evolved from 
the ancient Brahmi script and have a common phonetic structure, making a common char- 
acter set possible. The Northern scripts are Devanagari, Punjabi, Gujarati, Oriya, Bengali 
and Assamese, while the Southern scripts are Telugu, Kannada, Malayalam and Tamil. 

The official language of India, Hindi, is written in the Devanagari script. Devanagari is also 
used for writing Marathi and Sanskrit. It is also the official script of Nepal. 

As Perso-Arabic scripts have a different alphabet, a diiferent standard is envisaged for 
them. 

An Attribute mechanism has been provided for selection of different Indian script font and 
display attributes. An Extension mechanism allows use of more characters along with the 
ISCII code. These are only meant for the environment where no other alternative selection 
mechanism is available. 

The ISCII code table is a super-set of all the characters required in the ten Brahmi-based 
Indian scripts. For Convenience, the alphabet of the official script Devanagari (with 
diacritic marks for non-Devanagari alphabets) has been used in the standard. For notational 
simplicity, elsewhere, the term Indian scripts implies Brahmi-based Indian scripts. 

Annex- A provides information on the shapes of the corresponding alphabet of the 10 
Indian scripts. Annexes B and C provide information on the adaptation of the ISCII code 
for an IBM-PC and "English- Alphabet only" environment. Annex-D defines a suitable 
keyboard overlay which is common for all the Indian scripts. Annex -E defines the Attribute 
codes used for selection of different scripts and display attributes. Annex-F defines the 
Roman script transliteration scheme for all the Indian scripts. Annex-G defines the Vedic 
character set available through the Exteiision code. Annex -H defines the conversion 
mechanism between the ISCII code and the earlier ISSCII-83 code used in bilingual telex 
machines. 
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History 

Since the 70s, different committees of the Department of Official Languages and the 
Department of Electronics (DOE) have been evolving different codes and keyboards 
which could cater to all the Indian scripts due to their common phonetic structure. Earlier 
efforts could not keep the ASCII code intact. 

In July 1983, DOE announced the ISSCII-83 code which complied with the ISO 8-bit 
code recommendations ("Report of the sub-committee on Standardization of Indian 
Scripts and their Codes for Information Processing", DOE, July 1983). While retaining 
the ASCII character set in the lower half, it provided the Indian script character set in the 
upper 96 characters. This also had the recommendation on a Phonographic based 
keyboard layout for all the Indian scripts. 

A keyboard standard for Indian scripts was brought out by DOE in 1 986 (Report of the 
committee for "Standardization of Keyboard Layout for Indian Script Based Comput- 
ers" in Electronics-Information & Planning, Vol. 14, No. l,Oct. 1986). The report also 
contained the recommendation for the corresponding 8-bit ISCII code. 

There was a revision of the ISCII code by DOE in 1988 for making it more compact, in 
order to evolve its corresponding IBM-PC counterpart: PC-ISCII (Report of the sub- 
committee on "Standardization of Indian Script codes for Information Interchange", 
DOE, August 1988). 
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Indian Standard 

INDIAN SCRIPT CODE FOR INFORMATION 
INTERCHANGE - ISCil 



1. SCOPE 

The ISCII code standard specifies a 7-bit code table which can 
be used in 7 or 8-bit ISO compatible environment. It allows 
English and Indian script alphabets to be used simultaneously. 

It shall not be used in incompatible environments like that of 
IBM-PC, and with computers which do not allow 8-bit charac- 
ters, or which do not follow ISO code extension techniques. 

It cannot be used in the 5-bit Baudot code used for telecom- 
munications. However transcoding to Baudot is possible as 
given in Annex-H. 



2. TERMINOLOGY 



2.1 Alphabet/Script Terminology 

2.1.1 Letter: A character representing one or more of the 
simple or compound sounds used in speech. It can be any of the 
alphabetic symbols. 

2.1 .2 Conjunct ( Ligature ): A letter which is a combination of 
two or more basic letters. The shape of the conjunct may, or 
may not, give clue to the constituting letters. Example: the joint 
form (digraph) of "ae". 

2.1.3 Diacritic mark: A mark added to a letter which distin- 
guishes it from the same letter without a mark, usually having 
«i different phonetic value or stress. 

2.1.4 International numerals: The conventional to 9 digits 
used in English for denoting numbers. These are also known as 
Indo-Arabic numerals (to differentiate them from the Roman 
numerals like IX for 9). 

2.1 .5 Script numerals: The to 9 digits in a script , which have 
shapes distinct from their international counterparts. 

2.1.6 Vowel: A letter representing a speech sound made with 
the vibration of the vocal cords, but without audible obstruction . 
English examples: a, e, i, o, u. 

2.1 .7 Vowel sign: A graphic character associated with a letter, 
to indicate a vowel to be associated with that character {Matra 
in Hindi). 

2.1.8 Diphthong: A compound vowel character, in which the 
articulation begins as for one vowel and moves onto another. 
Example: as in "coin", "loud" and "side". 

2.1.9 Consonant: A letter representing a speech sound in 
which the breath is at least partly obstructed, and which has to 



be combined with a vowel to form a syllable. 

2.1.10 Pure consonant: A consonant which does not have 
any vowel implicitly associated with it. Example: all the English 
consonants. 

2.1.11 Nasai consonant: A consonant pronounced with the 
breath passing through the nose. Example: m, n, ng . 

2.1 .12 Nasalized vowel: A vowel pronounced with the breath 
passing both through the nose and the mouth. Example: 
French bon voyage. In Indian scripts this is denoted by a 
Chandrabindu diacritic mark. 

2.1.13 Aspirated consonant: A consonant which is pro- 
nounced with an extra puff of air coming out at the time of 
release of the oral obstruction. This has a sound of an extra "h". 
Example: The initial "p" sound in English words like "pin" 
(phonetically "phin"). 

2.1.14 Syllable: A unit of pronunciation uttered without inter- 
ruption, forming whole or part of a word, and usually having one 
vowel or diphthong sound optionally surrounded by one or 
more consonants. Example: there are two syllables in "water" 
and three in "inferno". 

2.1 .1 5 Alphabet: Aset of letters used in writing a language. Ex- 
ample: the English alphabet consists of upper and lower-case 
letters A to Z. 

2.1 .1 6 Basic alphabet: The minimal set of letters which can be 
used for uniquely encoding every word of a language. Example : 
the basic alphabet for English consists of only the upper-case 
letters A to Z. 

2.1.17 Phonetic alphat>et: An alphabet which has direct cor- 
respondence between letters and sounds. Example: the Indian 
scripts. 

2.1.18 Latin alphabet: The alphatjet used for writing the 
language of ancient Rome. Also known as the Roman alpha- 
bet. Used today for writing English and some other European 
languages. 

2.1.19 Script: A distinctive and complete set of characters 

used for the written form of one or more languages. 

2.1.20 Roman script: The script based on the ancient Roman 
alphabet, with the letters A-Z and additional diacritic marks. 
Used for writing a language which is not usually written in1he 
Roman alphabet. 

2.1.21 Romanization: Representation of words of a script 
using the Roman alphabet, possibly through additions of 
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diacntic marks. Example; Romaji is the romanized form of the 
Japanese script. 

2.1 .22 Transliteration: Representation of words with the clos- 
est corresponding letters in an alphabet of a different language. 

2.2 Font/Display Terminology 

2.2.1 Font: A set of symbols used for display or printing of a 
script in a particular style. 

2.2.2 Display rendition: The process by which a string of 
characters is displayed (or printed). In this process several 
consecutive characters may combine with each other on the 
screen . The sequence of display of the characters may become 
different. 

2.2.3 Display composing: The process of organizing the 
basic shapes available in a font in order to display (or print) a 
word. 



2.3 Character/Coding Terminology 

2.3.1 Bit: Binary digit. It can have only two values: and 1 . 

2.3.2 Byte: A bit string that is operatea upon as a unit. It usually 
represents a character and usually consists of eight bits. 

2.3.3 Hex digit: Hexadecimal digit, where each digit has 16 
values. The values above 9 are denoted by the letters A to F 
as shown: A(10), B(11),C(12), D(13), E(14), F(15). Four bits 
are needed to encode a hex digit. 

2.3.4 Character: A symbol which can represent a letter, a 
numeral, a punctuation mark, a special symbol or even a 
control function. 

2.3.5 Control character (control code): A character which 
normally has no visual form, but affects the recording, process- 
ing, transmission or interpretation of data. 

2.3.6 Graphic character: A character, other than a control 
character, that has a visual representation. Normally handwrit- 
ten, printed or displayed. 

2.3.7 5-bit characters (S-bIt codes): Characters, whose code 
has 5 bits, allowing representation of 32 characters. 

2.3.8 7-bit characters (7-bit codes) : Characters . whose code 
has 7 bits, allowing representation of 1 28 characters. 

2.3.9 8-bit characters (&-bit codes): Characters , whose code 
has 8 bits, allowing representation of 256 characters. 

2.3.1 Character set: A set of characters grouped together for 
a purpose, like that of tepresenting a script. 

2.3.11 Code table: Atable showing the positions allotted to in- 
divklual characters from a character set. 



2.3.12 Character code: 

character. 



Position in the code table of the 



2.3.13 Code extension: The techniques for encoding of 
characters that are not included in the character set of a given 
code. 

2.3.14 Extended character set: Characters which are not 
present in the main character set, but are available through 
some code extension techniques. 

2.3.15 ASCII code: American Standard Code for Information 
Interchange. A 7-bit code which specifies 32 control characters 
and 96 graphic characters, for English language. 

2.3.1 6 Transcoding: A set of tables and rules by which a code- 
table can be transformed to another code-table, such that the 
characters get mapped to their equivalent forms. 

2.4 Other Terminology 

2.4.1 Direct sorting: Sorting of words done through direct 
comparison of the corresponding character codes. No special 
heuristics or rules are used. 

2 4.2 Dictionary sorting order: Order in which the letters 
should be organized within an alphabet, such that words can 
get c rdertfd according to the language dictionaries. Special 
rules may have to be applied in addition to direct sorting to 
achieve this. Example: in English, upper and lower cases have 
to be transformed to a single case before direct sorting is 
applied. 

2.4.3 Default: A value or state which is assumed when no 
value or state is explicitly stated. 

2.4.4 Keyboard overlay: Defines the characters for each key 
position (unshifted, shifted etc.), which are meant to replace the 
standard English characters on a QWERTY keyboard. 



3. ISCII CODE PHILOSOPHY 

A code for all the Indian scripts is made possible by their 
common origin from the Brahmi script. An optimal keyboard 
overlay for all the Indian scripts, is made possible by the 
phonetic nature of the alphabet. 

There are manifold advantages in having a common code and 
keyboard for all the Indian scripts. Any software which allows 
ISCII codes to t>e used, can be used in any Indian script, 
enhancing its commercial viability. Furthermore, immediate 
transliteration tjetween different Indian scripts becomes pos- 
sible, just by changing the display modes. Simultaneous availa- 
bility of multiple Indian languages in the computer medium will 
accelerate theii" development and facilitate national integration. 

The 8-bit ISCII code retains the standard ASCII code, while the 
Indian script keyboard overlay is designed for the standard 
English QWERTY oveHay. This ensures that English can co- 
exist with the Indian scripts. This approach also makes it 
feasible to use Indian scripts along with existing English com- 
puters and software, so long as B-bit character codes are al- 
lowed. 
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4. NATURE OF INDIAN ALPHABET 

All the Indian scripts have originated from the ancient Brahmi 
script which is phonetic in nature. The alphabet in each may 
vary somewhat, but they all share a common phonetic struc- 
ture. The differences between scripts primarily are in their 
written forms, where different combination rules get used. 



4.2 Anuswar 

Anuswar indicates a nasal consonant sourKl. When an Anuswar 
comes twfore a consonant belonging to any of the 5 Vargs, then 
it represertts the nasal consonant belonging to the Varg. Before 
a non-Varg consonant however the anuswar represents a 
different nasal sound. Some Hindi examples: 



4.1 The Consonants 

Indian script consonants have an implicit 3t (a) vowel included 
in them. They have t>een categorized according to their pho- 
netic properties. There are 5 Vargs (Groups) and non-Varg 
consonants. Each Varg contains 5 consonants, the last of which 
is a nasal one. The first four consonants of each Varg, constitute 
the Primary and Secondary pair. The second consonant of 
each pair is the aspirated counterpart (has an additional "h" 
sound) of the first one. 





Primary 


Secondary 


Nasal 


Vargl 


k kh 


7T tr 
g gh 


li 


Varg 2 


c ch 


J jh 


n 


Varg 3 


Z -3 

t th 


d dh 


n 


Varg 4 


t th 


d dh 


n 


Varg 5 


P ph 


b bh 


IT 
m 



non-Varg 



y r 1 V s s s h 



Note that the consonants TT ( 6 ) and ^ (s) are pronounced 
identically today. 

Apart from these consonants, there are some other consonants 
used in some specific Indian scripts: 



H (sn na) Comes instead of "^ (jb) at middle and end of 
Tamil words except in ttie ■-fl(;6^) conjunct. 

Tf ( y) Used In Oriya, Bengali-and Assamese. 

This is pronounced as "ja", while 'T get 
pronounced as 'ya". 

T( r) Is an extra frilled "ra" used in Tamil, Telugu and 

Malayalam. In Marathi it is used tor forming 
thefia!f-raasin"<<l-q'r (^TT~'aT ). 

o5 { I ) Used in Tamil, Telugu, Kannada, 

Malayalam, Oriya, Gujarati and Maratfii. 

aS (z) Used in Tamil and Malayalam. 



Vargl 


arik 


tT^=xf« 

partkh 


gar^ga 


«^t|=^ 
sai^gh 


Varg 2 


mafic 


pafichi 


TS3II=Tr5ir 

pafija 


sdfijh 


Varg 3 


gharitd 


kanth 


jhandd 


dftOndh 


Varg 4 


sant 


^p«t=^ 
panth 


band 


gandh 


Varg 5 


campa 


gumph 


khambS 


stambh 



4.3 Nasalization Sign: Chandrabindu 

The ~ denotes nasalization of the preceding vowel (can be 
implicit 3T vowel within a consonant). Example: 3#ff, wn, yn^ 

In Devanagari script it often gets substituted with Anuswar, as 
the latter is more convenient for writing. In some words, how- 
ever, Anuswar and Chandrabindu can give different meanings. 
Hindi example:'FT(Laugh),iH(Swan). 



4.4 Visarg: 

Comes after a vowel sound, and represents a sound similar to 
"h". This also represents the Aytham .'. character in Tamil. 



4.5 Vowels and Vowel signs (Matras) 

There are separate symbols for all the vowels in Indian scripts 
which are pronounced independently (either at the beginning of 
a word, or after a vowel sound). The consonants in the Indian 
script themselves have an implicit vowel « (a). To indicate a 
vowel sound other than the implicit one, a vowel-sign (Matra) is 
attachedto the consonant. Thus there are equivalent Matras for 
all the vowels, excepting the 3T vowel. 
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Roman 


a 


i 


i 


u 


u 


r 


e 


Vowel 


art 


? 


f 


3 


^ 


^ 


^ 


Matra 


T 


f 


■) 


o 


fc. 


- 


^ 


Matraon^ 


^ 


n 


^ 


3^ 


^ 


T 


% 


Roman 


e 


ai 


e 





6 


au 


6 


Vowel 


'^ 


^ 


^ 


^ 


^ 


4 


3fi 


Matra 


:^ 


A 


- 


1 


> 


> 


T 


Matra on ^ 


% 


% 


% 


^ 


^ 


^ 


^ 



4.8 Diacritic Mark: Nukta - 

The Nukta is used for T and ^ characters, in some Northern 
scripts. It is also used for deriving 5 other consonants in the 
Devanagari and Punjabi scripts, required for Urdu. 



The original pronunciation of the vowel ^ (r) is now lost; it gets 
pronounced mostly as "ri" or "ru". 

The vowels^ and sft are used in Southern scripts for denoting 
vowels shorter than ^i and # respectively. 

The vowels ^ (ai) and ^ (au) are actually diphthongs, although 
in Hindi they also get pronounced as longer vowel forms of ^ 
and 3it respectively. 

Vowels^ and3ff are used in modern Devanagari for represent- 
ing the English vowel sounds as in "bat" and "ball" respectively. 

Sanskrit infrequently uses three other vowels, which are obso- 
lete today in other Indian scripts. These are: 

Vowels: ^ c? c? 

Matras: ~ n "^ 



4.6 Vowel Omission Sign: Halant - 

In Indian scripts consonants are assumed to have an implicit 
vowel 3T"a" within them unless an explicit Matra (vowel-sign) is 
attached. Thus a special sign Halant ( 7 ) is needed for Indicat- 
ing that the consonant does not have the Implicit 3? vowel In it. 

In Northern languages, the Halant at the end of a word generally 
gets dropped, though the ending still gets pronounced without 
a vowel. Example: Ashok = ^rof^ => mi[^. 

This doesn't happen in Southern languages and Sanskrit, 
where a Halant is always used to indicate a vowel-less ending. 
Example: param = 'TT^(Sanskritword). 

4.7 Conjuncts 

Indian scripts contain numerous conjuncts, which essentially 
are clusters of upto four consonants without the Intervening 
implicit vowels. The shape of these conjuncts can differ from 
those of the constituting consonants. These conjuncts are 
formed in the ISCII code by putting the Halant (~ ) character, 
l)etween the constituent consonants. 
Example: «rf5rT = ^~^ TT'Tf" "JT 



•^ 


Tar 


■n 


^ 


^ 


^ 


■^ 


k 


kh 


9 


j 


d 


dh 


ph 


w 


13 


TT 


i\ 


-s 


■? 


■SFT 


q 


kh 


gh 


z 


d 


dh 


f 



4.9 Punctuation 

All punctuation marks used in Indian scripts are borrowed from 
English, except for the full-stop, Instead of which a Viram (i) is 
used In the Northern scripts. The Viram is, however, being in- 
creasingly substituted by a full-stop. A double Viram ( 1 1) Is also 
used in Sanskrit texts for indicating a verse ending. 



4.10 Other Signs 

4.10.1 Avagrah J is primarily used in Sanskrit texts. It creates 
an extra stress on the preceding vowel. Two Avagrahs can be 
used for creating further extra stress. Avagrah is not used in 
modern Indian scripts. 

4.10.2 Cm iS> is a Hindu religious symbol. 



4.11 Numerals 

Many Indian scripts today use only the International numerals. 
Even in others, the usage of international numerals instead of 
the original forms is increasing. Although the Devanagari script 
has its own numerals, the official numeral system is the Inter- 
national one. 



5. LAYOUT OF ISCil CODE TABLE 

The 8-bit Code for Latin and Indian script alphabets Is given In 
Table-1 . It consists of 256 positions, arranged in 16 rows and 
1 6 columns. The rows are numbered in decimal as to 1 5, and 
In hex as to F. The columns are numbered in decimal as to 
240 in increments of 16, and in hex as to F. The lower 128 
characters of this table contain the ASCII character set. 

The 7-bit Code for Indian script alphabets is given in Table-2. 
It is meant for an ISO compatible 7/8 bit environment. It consists 
of 94 positions, arranged in 8 columns and 1 6 rows. 

A position in the Code table is identified in decimal as well as 
hex notation. A character located at decimal column x and row 
y will have its decimal position as x-ny. A character located at 
hex column x and row y, will have its hex position as xy. 
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Tabto-1 8-bit Code Table of the Latin and Indian Script Alphabet 



Hex 





1 


2 


3 


4 


5 


6 


7 


8 


g 


A 


B 


C 


D 


E 


F 


Hex Dec. 





16 


32 


48 


64 


80 


96 


112 


128 


144 


160 


176 


192 


208 


224 


240 








NUL 


OLE 


SP 





@ 


P 




P 








# 


T 


T 


±>. 


EXT 


1 


1 


SOH 


DC1 


! 


1 


A 


Q 


a 


q 






^ 


4l 


T^ 


H 


^ 


o 


2 


2 


STX 


DC2 


n 


2 


B 


R 


i> 


r 






- 


^ 


^ 


oS 


i 


% 


3 


3 


ETX 


DC3 


# 


3 


C 


S 


c 


s 






•r 


V 


«r 


? 


^ 


^ 


4 


4 


EOT 


DC4 


$ 


4 


D 


T 


d 


t 






3T 


-ar 


^ 


^ 


t 


^ 


5 


5 


ENQ 


NAK 


% 


5 


E 


U 


e 


u 






m 


n 


«r 


■?T 


> 


V 


6 


6 


ACK 


SYN 


& 


6 


F 


V 


f 


V 






J 


ff 


•^ 


TSf 


\ 


•f 


7 


7 


BEL 


ETB 


• 


7 


G 


w 


g 


w 






i 


s 


^ 


^ 


T 


^ 


8 


8 


BS 


CAN 


( 


8 


H 


X 


h 


X 






ir 


^ 


^ 


If 


- 


\9 


9 


9 


HT 


EM 


) 


9 


1 


Y 


i 


y 






^ 


s 


^ 


INV 


- 


6 


A 


10 


LF 


SUB 


* 


-■ 


J 


z 


i 


z 






^ 


■3T 


"^ 


T 


1 


% 


B 


11 


VT 


ESC 


+ 


; 


K 


I 


k 


( 






t 


F 


^ 


f 






C 


12 


FF 


FS 


> 


< 


L 


\ 


1 


1 






t? 


3f 


-R 


1 






D 


13 


CR 


GS 




as 


M 


] 


m 


} 






^ 


Z 


^ 


■* 






E 


14 


SO 


RS 




> 


N 


A 


n 


- 






1? 


^ 


^ 


»v 






F 


15 SI 


US 


/ 


? 


O 


- 





DEL 




# 


s 


T 


£ 


ATR 





Tabie-2 7-bit Code Table of the Indian Script Alphabet 





Hex 
Dec. 


A 


B 


C 


D 


E 


F 


Hex 


160 


176 


192 


208 


224 


240 




1 




1 




# 
4 


^ 

^ 






EXT 




^ 


2 


2 


— 


3ff 


tT 


H 


4. 


? 


3 


3 


T 


^ 


«T 


■55 


^ 


^ 


4 


4 


3T 


^ 


^ 


^ 


> 




5 


5 


3n 


TT 


«T 


"?r 


> 


X 


6 


6 


? 


^ 


H 


IT 


> 


^ 


7 


7 


f 


^ 


7T 


^ 


T 


^ 


8 


8 


7 


^ 


^ 


? 


- 


V9 


9 


9 


IS 


■5 


^ 


INV 


- 


6 


A 


10 


^ 


^ 


^ 


T 


t 


% 


B 


11 


^^ 


^ 


"<T 


f 






C 


12 


^ 


^ 


TT 


•^ 






D 


13 


^ 


? 


■Jf 


^ 






E 
F 


14 
15 


3Tr 


•3 


1; 

T 


i^ 


ATR 







IS 13194:1991 



Table-3: ISCti Character set - Coded representation 



Position 1 








Position 1 






Hex 


Dec. 


Char 


Name 




Hex 


Dec. 


Char 


Name 


A1 


161 


±. 


Vowfcl-modifier CHANDRABINDU 


DO 


208 


T 


Consonant Hard RA (Southern Scripts) 


A2 


162 


- 


Vowel-modifier ANUSWAR 




Dl 


209 


c=T 


Consonant LA 


A3 


163 


- 


Vowel-modifier VISARG 




D2 


210 


ss 


Consonant Hard LA 


A4 


164 


3f 


Vowe) A 




D3 


211 


=? 


Consonant ZHA (Tamil & (^alayalam) 


A5 


165 


3TT 


Vowel AA 




D4 


212 


■g 


Consonant VA 


A6 


166 


J 


Vowel 1 




D5 


213 


?r 


Consonant SHA 


A7 


167 


i 


Vowel II 




D6 


214 


^ 


Consonant Hard SHA 


A8 


168 


3 


Vowel U 




D7 


215 


■H 


Consonant SA 


A9 


169 


^ 


Vowel UU 




D8 


216 


f 


Consonant HA 


AA 


170 


TH 


Vowel Rl 




D9 


217 


INV 


Consonant INVISIBLE 


AB 


171 


^ 


Vowel E (Southern Scripts) 




DA 


218 


T 


Vowel Sign AA 


AC 


172 


^ 


Vowel EY 




DB 


219 


f 


Vowel Sign 1 


AD 


173 


^ 


Vowel Al 




DC 


220 


t 


Vowel Sign II 


AE 


174 


i 


Vowel AYE {Devanagari Script) 




DD 


221 


^ 


Vowel Sign U 


AF 


175 


3it 


Vowel (Southern Scripts) 




DE 


222 


ex 


Vowel Sign UU 


BO 


176 


s^ 


Vowel OW 




DF 


223 


— 


Vowel Sign Rl 


B1 


177 


^ 


Vowel AU 




EO 


224 


i 


Vowel Sign E (Southern Scripts) 


B2 


178 


m 


Vowel AWE (Devanagari Script) 




El 


225 


i 


Vowel Sign EY 


B3 


179 


^ 


Consonant KA 




E2 


226 


A 


Vowel Sign Al 


B4 


180 


Tg 


Consonant KHA 




E3 


227 


— 


Vowel Sign AYE (Devanagari Script) 


85 


181 


T 


Consonant GA 




E4 


228 


1 


Vowel Sign (Southern Scripts) 


86 


182 


•^ 


Consonant GHA 




E5 


229 


> 


Vowel Sign OW 


B7 


183 


s 


Consonant NGA 




E6 


230 


> 


Vowel Sign AU 


88 


184 


■Ef 


Consonant CHA 




E7 


231 


f 


Vowel Sign AWE (Devanagari Script) 


B9 


185 " 


s 


Consonant CHHA 




E8 


232 


— 


Vowel Omission Sign (Halant) 


BA 


186 


^ 


Consonant JA 




E9 


233 


— 


Diacritic Sign (Nukta) 


B8 


187 


V 


Consonant JHA 




EA 


234 


1 


Full Stop (Viram, Northern scripts) 


BC 


188 


^ 


Consonant JNA 




EB 


235 




This position shall not lie used 


BD 


189 


1 


Consonant Hard TA 




EC 


236 




This position shall not be used 


BE 


190 


"3 


Consonant Hard THA 




ED 


237 




This position shall not tie used 


BF 


191 


^ 


Consonant Hard DA 




EE 


238 




This position shall not be used 


CO 


192 


^ 


Consonant Hard DHA 




EF 


239 


ATR 


Attribute Code 


C1 


193 


TI 


Consonant Hard NA 




FO 


240 


EXT 


Extension Code 


C2 


194 


rT 


Consonant Soft TA 




F1 


241 


O 


Digit 


C3 


195 


n 


Consonant Soft THA 




F2 


242 


^ 


Digit 1 


C4 


196 


T 


Consonant Soft DA 




F3 


243 


^ 


Digit 2 


C5 


197 


«l 


Consonant Soft DHA 




F4 


244 


3 


Digit 3 


C6 


198 


^ 


Consonant Soft NA 




F5 


245 


t 


Digit 4 


C7 


199 


■? 


Consonant NA (Tamil) 




F6 


246 


\ 


Digit 5 


C8 


200 


-^ 


Consonant PA 




F7 


247 


^ 


Digit 6 


C9 


201 


^ 


Consonant PHA 




F8 


248 


vs 


Digit 7 


CA 


202 


^ 


Consonant BA 




F9 


249 


c 


Digit 8 


CB 


203 


V 


Consonant BHA 




FA 


250 


\ 


Digit 9 


CC 


204 


q- 


Consonant MA 




FB 


251 




This position shall not be used 


CD 


205 


^ 


Consonant YA 




FC 


252 




This position shall not be used 


CE 


206 


?r 


Consonant JYA 

(Bengali, Assamese & Ortya) 




FD 
FE 


253 
254 




This position shall not be used 
This position shall not be used 


CF 


207 


J 


Consonant RA 













Note; 1 . The positions EB-EE and FB-FE, are reserved for future expansion of the code. 
2. Scripts corresponding to other Indian languages are given in Annex-A. 
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6. STRUCTURE OF THE ISCII CODE 



A common alphabet for all the Indian scripts is made possible 
by their common origin from the same ancient Brahmi script. 
The ISC 11 code contains only the basic alphabet required by the 
Indian scripts. All the composite characters are fomied through 
combinations of these basic characters. 

6.1 Vowels and Matras 

The ISCII code contains separate vowels and Matras (Vowel 
signs). While a vowel sign can be used independently, the 
Matra sign is valid only after a consonant. Thus: 



6.2 Vowel Modifiers *, ^ ,t 

After a consonant, vowel or Matra character, a character can 
be used which modifies the vowel sound and is called a "Vowel 
Modifier" . This can be a Chandrabindu (~), Anuswar (~) or 
Visarg (T) . Example: 



6.3 Halant- 

The implicit vowel in a consonant can be removed by addition 
of a Halant sign (~) . In the ISCII code conjuncts are formed by 
typing a Halant character between consonants. Aconjunct may 
consist of upto 4 consonants joined by Halants. Example: 



^ ~ tT = tP 


■?r-;T= ?r 


TTT -g = ff 


^ ' 'S ~ T = IS? 


^ ~ 1^ = « 


■fT' T = 31 


■5T~ ^ = -31 


T ~ ■^•~ ^ = ■^ 



6.3.2 Soft Halant 

A Soft Halant is formed by typing a Nukta character after a 
Halant. in Devanagari the Soft Halant allows retention of the 
"half form' for the preceding consonant, and prevents it from 
combining with the following consonant. Example: 









Soft Halant is used in Malayalam along with some consonants 
to derive separate pure consonant shapes which do not show 
an attached Halant symljol: 



em 



T = ait 



m 









T) 



6.4 Invisible Consonant INV 

The INV (Invisible) code is used for formation of composite 
characters which require a consonantal base, but where the 
consonant itself ought to be invisible. These may be required 
only for some special display purposes. Example: 

^- INV =^ INV - T = 7 

T - INV = ' INV - 7 = - 

INV f - = n INV -^ - = -^ 



6.5 The Nukta Character t 

The Nukta consonants (^ "3 'H vT "? ?^ w) get formed by adding 
a Nukta (7 ) character immediately after the appropriate conso- 
nant. 

In the ISCII codeihe same Nukta character is thought of as an 
operator to derive some of the lesser used Sanskrit characters 
which are not directly available on the Inscript keyboard. 

A Nukta can be typed after a Halant to form a Soft Halant. 



In practice, a Halant sign is shown only if the consonants do not 
change their shape by joining up. Tamil script has no conjuncts, 
and thus an explicit Halant sign always gets used. Here are 
some Devanagari examples where Halant does not disappear: 



6.3.1 Explicit Halant 

A Halant is used between consonants to form conjuncts. But 
many times in Sanskrit and Vedic texts, one may wish to show 
an Explicit Halant which would be shown on the previous 
consonant, and which would prevent the consonant from 
joining with the next one. Two consecutive Halants form an 
Explicit Halant. Example: 



^- ^ f" = f^ 
z-rf- = fi 






Table 4: 


ISCII characters derived by appending a Nukta 


Char 


Nukta Char 


Name 


W 


■S) 


Consonant QA (Urdu) 


TS 


^ 


Consonant KHHA (Urdu) 


TT 


TT 


Consonant GHHA (Urdu) 


^ 


vT 


Consonant ZA (Urdu) 


^ 


^ 


Consonant Flapped DA 


S 


T 


Consonant Flapped DHA 


■V 


■3? 


Consonant FA (Urdu) 


n 


^ 


Vowel Rll (Sanskrit) 


c 


« 


Vowel Sign Rll (Sanskrit) 


J 


n 


Vowel LI (Sanskrit) 


f 


n 


Vowel Sign LI (Sanskrit) 


f 


^ 


Vowel Lll (Sanskrit) 


1 


^ 


Vowel Sign Lll (Sanskrit) 




3S. 


Sign OM 


1 


5 


Vowel Stress Sign AVAGRAH 
(Sanskrit) 
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6.6 Attribute Code (ATR) 

The Attribute code, followed by a displayable ASCII character, 
defines a font attribute applicable for the following char- 
acters. This mechanism is meant for use in that medium 
where alternative font selection mechanism is not available. 
The details are given in the Annex-E. 

6.7 Extension Code (EXT) 

The Extension code, followed by an ISCII character, defines a 
new character which can combine with the previous ISCII 
character. This provision has been primarily made for supple- 
menting Vedic signs along with the Devanagari text. The Vedic 
character details are given in the Annex-G. 

6.8 Numerals 

in all the Indian scripts the international numerals are being 
used increasingly. From the software viewpoint, usage of the 
same numerals as given in the ASCII set allows proper 
handling of numerals by existing software. Fordisplay rendition 
purposes however, it may be sometimes desirable to have 
separate Indian script numerals which are given in the ISCII 
table. 

The ATR mechanism also allows display rendition of the ASCII 
numerals in an Indian script form. The ISCII numerals should be 
used only when it is not possible to use the ATR mechanism for 
selecting numerals in an Indian script. 



7. PROPERTIES OF ISCII CODE 



Vowel combinations and consonant combinations would get 
ordered as shown below. 



3t 3T 


3Tt 3n - 


aft" 3^ 3Rf"3Rf 




* W> 


sFt ^ - 


¥?l'^ tT'^ft 


^ 


13 fS 


■^ '^ 


isfl''^ Tgf"^ 


15 


T ? 


^ ?T 


'^' "^ if a 


^ 



As shown in the chart above, in Indian scripts a character 
followed by a vowel-modifier comes before the character 
without it. This is ensured by keeping the vowel-modifiers in the 
beginning of the ISCII code table. The only exception is when 
a vowel-modifier comes at the And of a word; since the 
comparison is now with the ASCII "space" character (32 deci- 
mal) having a lower value, the vowel-modifier character cannot 
come before the space. Though it is possible to devise another 
space character having a higher value, it will not be practically 
possible to type it in. However, the fact that a vowel-modifier 
would come after a space ttecomes intuitive as a longer word 
is expected to come after its shorter counterpart. Example: 

atf < 3t^ < 3RWf 

The Nukta consonants are essentially separate consonants, 
and thus should get sorted separately. This indeed happens 
since Nukta is kept after all the Indian script characters (except- 
ing Viram, which is a punctuation). 

The new Vowels ^ and ^ have been kept after the long 
vowels '^ and 3^ respectively, as the new vowels have more 
stress. They get substituted by the long vowels in the traditional 
text. 



7.1 Phonetic Sequence 

The ISCII characters, within a word, are kept in the same order 
las they would get pronounced. Example: 

As shown in the latter example, the display order may be 
different from the phonetic order. Having a spelling according 
to the phonetic order allows a name to t>e typed in the same 
way, regardless of the script it has to be displayed in. 

7.2 Direct Sorting 

Since there are variations in ordering of a tew consonants 
between different Indian scripts, it is not possible to achieve 
perfect sorting in all Indian scripts. Special routines would be 
required when some characters like "Nukta" need to be ignored 
for the purpose of sorting. For most purposes, however, the 
direct sorting achieved through the ISCII code should be 
sufficient. 



7.3 Unique Spellings 

By using only the basic characters in ISCII, there is only one 
unique way of typing a word. This would not have been possible 
ifconjunctslike ^,?, ?Tetc. had beengiven separate codes. The 
spelling of a word is now the phonetic order of the constituent 
basic characters. This provides a unique spelling for each 
word, which is not affected by the display rendition. 

For obtaining unique spellings. Soft Halant, Explicit Haiant, 
and INV characters should not be used. These have been 
provided only for deriving different display renditions, and are 
not needed normally. 

The spelling of a word contains all the information necessary 
for display composition, which can be automatically done 
through display algorithms, it becomes possible to type in a 
text, without even looking at the display. When the tedium of 
composing goes away, on-line authoring becomes possible, 
where an author can think out new text while he is typing it. 

Unique spellings are essential for making spelling checkers 
and dictionaries. They are also essentia! to facilitate finding of 
words in a word-processor, or for information retrieval from a 
data-base. 
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7.4 Display Independence 

A word in an Indian script can be displayed in a variety of styles 
depending on the conjunct repertoire used. ISCII codes however 
allow a complete delinking of the codes from the displayed fonts. 

An ISCII syllable can t>e displayed using combination of basic 
shapes. Different implementations can choose variant tech- 
niques in combination of these basic shapes.The same text can 
thus be seen in different font styles by using a different font 
composition routine. 

The Inscript keyboard overlay has one-to-one correspondence 
with the ISCII code. This way, typing of word does not depend 
upon its displayed form. 



7.5 Transliteration 

The ISCII codes are rendered on the display device according to 
the display composition methodology of the selected script. 
Transliteration to another script can thus be obtained by merely 
redisplaying the same text in a different script. 

Since the display rendering process can be very flexible, it is 
possible lo transliterate the Indian scripts to the Roman script, 
using diacritic marks. Similarly it is possible to transliterate them 
to other scripts such as Perso-Arabic. 

Transliteration involves mere change of the script, in a manner 
that pronunciation is not affected. This is not the same as 
"translation" where the language itself changes. 



8. ISCII CODE SYNTAX 

In ISCII code some logically related sub-sets can be identified 
through simple range comparisons. Using these it is possible to 
predict a syllable boundary for an Indian script word. This may be 
necessary for composing fonts for display purposes, or for 
hyphenation at a syllable boundary. 

Consonants (C) 
Vowels <V) 

3T 37T ? f ^ 3r -=5 •^ TT ^ 1^ 3ft 3?t 3||- 3Tf 



Matras (M) 

If ^ 



_ _ _ ^ .^ i j: 



1 > ^ T 



Vowel modifiers (D) * 
Hatant(H) 7 
-Nukta(N) T 



8.1 Indian Script Word Syntax 

An Indian script word contains one or more syllables, the syntax 
for which is given in the following Backus-Naur Formalism (BNF) . 

Word ::= {Syllable} (Cons-Syllable) 

Syllable ::= Cons-Vowel-Syllable | Vowel-Syllable 

Vowel-Syllable ::= V [D] 

Cons- Vowel-Syllable ::= [Cons-Syllable] Full-Cons 

[M] [D] 
Cons-Syllable v.= [Pure-Cons] [Pure-Cons] Pure-Cons 
Pure-Cons ::= Full-Cons H 
Full-Cons ::= C [N] 

Following conventions are used in the syntax given atrave: 

::= defines a relation. 

{ } enclose items which may be repeated one or 

more times. 
[ ] enclose items which may not be present. 
I separates items, out of which only one can be 

present. 

8.2 Order within a Syllable 

A worst case consonant syllable can contain: 
CNHCNHCNHCNMD 

A worst case vowel syllable can contain: 
VD 



Note: 



• Nukta (N) can come after only the consonants with 
which it can combine. 

• The above syntax ignores the vowels derived through 
Nukta { W , ^ and <? ) and the Avagrah sign s. 

• The INV character not mentioned here is treated as 
consonant. 

• The Halant + Halant ( Explicit Halant) and Halant + 
Nukta (Soft Halant) combinations have been ignored in 
the above discussion. 
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ANNEX - A 
INDIAN SCRIPT ALPHABET CORRESPONDENCE 



Following mnemonics are used for Indian scripts : 
DEV: Devanagari PNJ: Punjabi GJR: Gujarat! 
ORI: Oriya BNG: Bengali ASM: Assamese 



TLG:Telugu KND: Kannada MLM: Malayalam 

TML: Tamil RMN: Roman 

Roman script transliteration scheme is explained in Annex F. 





RMN 


DEV 


PNJ 


GJR 


ORI 


BNG 


ASM 


TLG 


KND 


MLM 


TML 


>& 


m 


■i. 




.*. 


K* 


u/ 


y 










'_ 


m 


,,• 


/^ 


.r. 


.-P 


c:^ 


c^ 


C>0 


c>o 


Q:0 




O '■ 


h 


■;■■■,• : 




.;■■■;.; 


08 


:"''i*» 












3T 


a 


3T 


>K 


M 


21 


vSI 


^ 


«;$> 


« 


fSTO 


^ 


3TT 


a 


3TT 


»r 


HI 


211 


^ 


^ 


^ 


W 


(81(d> 


-^ 


I 


i 


^ 


fe 


6 


Q 


^ 


^ 


a 


f^ 


iD 


@ 


i 


i 


^ 


Ht 


€ 


€1 


^ 


^ 


^ 


^ 


^"D 


F 


^ 


u 


^ 


§ 


6 


Q 


^ 


^ 


^ 


ero 


9 


a. 


^ 


u 


^ 


t 


a 


Q 


§ 


d 


^ 


ea® 


^•D 


s>^ 


^ 


r 


^ 




:«fc 


6\ 


501 


^ 


?rinr) 


ali 


B 




^ 


e 














<o 


(^ 


^ 


«ST 


^ 


e 


13: 


t 


5;i 


XJ 


<9 


a 


<i3 


^ 


'^ 


€jr 


^ 


ai 


^ 


^ 


5^ 


^ 


^ 


a 


SD 


20 


sw^ 


S 




e 


^ 




















3lt 

















3^ 


is. 


€3 


f? 


3ft 


5 


# 


§ 


5Hl 


G 


\3 


v3 


L 


L 


&0 


9 


4 


au 


4 




5hI 


Ql 


^§ 


^ 


c 


S 


Q'D 


96IT 


3Tf 

























^ 


k 


^ 


^ 


i 


Q 


^ 


^ 


^ 


?J 


«a» 


a 


IF 


q 


cp 


of 


















JS 


kh 


TI 


y 


H 


61 


vr 


'^r 


^ 


5D 


gy 




'^ 


kb 


w 


M 


















TT 


g 


n 


^ 


31 


61 


9r 


ST 


/^ 


rt 


en 




TT 


gb 


7T 


^ 
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RMN 


DEV 


PNJ 


GJR 


ORI 


BNG 


ASM 


TLG 


KND 


MLM 


TML 


^ 


gh 


^ 


UT 


£1 


a 


^ 


^ 


Cx) 


3^ 


«ej 




^ 


li 


^^■ 


5 


d- 


^ 


vs 


»» 


& 


t5 


613 


1^ 


^ 


c 


^ 


H 


^ 


© 


F 


u 


.5 


^ 


Hi 


ff 


^ 


ch 


■^ 


^ 


y 


a 


^ 


^ 


4 


^ 


ci!Q 




^ 


i 


^ 


tT 


•y 


G 


^ 


^ 


23 


Zi 


n 


8g 


tT 


z 


If 


rT 


















^ 


jh 


^ 


? 


01 


« 


?l 


^ 


^ 


^ 


«JUD 




^ 


n 


^ 


? 


01 


8 


5P 


fip 


sr 


t-p 


6T0) 


(65 


^ 


t 


z 


Z 


8 


€• 


^ 


^ 


6o 


13 


s 


L_ 


^ 


th 


3 


5 


6 





^ 


^ 


6 


^ 


o 




^ 


d 


^ 


^ 


vi 





T5 


^ 


d 


TJ^ 


OJO 




^ 


A, 

d 


1" 


^ 




9 


? 


^ 










^ 


dh 


"5 


^ 


<! 


Q 


15 


15 


4 


^ 


aj9 




^ 


dh 


^ 






Q 


? 


? 










^ 


n 


TIT 


^ 


"il 


g1 


«r 


«f 


£0 


H) 


em 


eucr 


"tT 


t 


TT 


3 


d 





\o 


\o 


^ 


^ 


(Si 


^ 


2T 


th 


8T 


g 


«l 


8i 


«?r 


Sf 


<^ 


^ 


m 




^ 


d 


^ 


H 


e 


Q 


^ 


^ 


6 


c! 


e 




^ 


dh 


«r 


^ 


H 


U 


«f 


«f 


6 


d 


CJU 




^ 


n 


^ 


s 


n 


ft 


^ 


^ 


^ 


?^ 


(D 


^ 


H 


n 




















GOT 


^ 


P 


^ 


u 


H 


a 


^ 


-'T 


^ 


cd 


oJ 


U 


■^ 


ph 


■^ 


^ 


^ 


ep 


^ 


^ 


^ 


^ 


(TlD 




IF 


f 


w 


g- 


















"^ 


b 


■^ 


g 


H 


"9 


^ 


^ 


SJ 


tD 


OTiJ 




^ 


bh 


•Cf 


^ 


CH 


Q 


^ 


^ 


^ 


^ 


S 




"IT 


m 


TT 


H 


H 


n 


IT 


^ 


clS^ 


^ 


(D 


LD 


^ 


y 


^ 


^ 


^. 


a 


^ 


¥ 


0^ 


nil 


OQ) 


tu 


^ 


y 








a 


"51 


^ 










T 


r 


T 


U 


I 


Q 


^ 


^ 


6 


d 


fD 


rr 


I 


I 


J 












®Q 




O 


rD 
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RMN 



DEV 



PNJ 



GJR 



ORI 



BNG 



ASM 



TLG 



KND 



MLM 



TML 



aS 

f(:> 






z 

V 

s 

s 

s 

h 

a 

i 

i 

u 

u 

r 

e 

e 

ai 

e 




au 
o 



c7 

t) 






B 



H 
H 

f::> 



H 
(A 






€1 
9 



601 
6<51 



cT 
























r-TT 






^ 



3d 

9 

oe 
c:j3 






9J 

OJ 
(/O 

orv) 
on 

ff>C> 
a-::>o 



60 
€TT 

eu 
en) 



OO 

€5) ••.'':• 

QOrr 
GOrr 

Qv;:-6TT 



Note : T ^ is used in Devanagari for representing the half i from - as in w=qT. 
Nukta coni ^nants shown in Devanagan are used only in Hindi. 
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ANNEX-B 
PC-ISCII CODE 



Hex 



Hex 



Dec. 



128 



144 



160 



176 



192 



208 



224 



240 









1 


1 


2 


2 


3 


3 


4 


4 


5 


5 


6 


6 


7 


7 


8 


8 


9 


9 


A 


10 


B 


11 


C 


12 


D 


13 


E 


14 


F 


15 



3T 

T 






IT 

IT 

T 
J 



ATR 

o5 
oS 

INV 
T 
f 



EXT 

d. 



The PC-ISCII code is the version of ISCII code defined by 
Centre for Development of Advanced Computing (CDAC), 
Rune, for compatibility with IBM-PC 8-bit character set. IBM-PC 
does not follow tfi© ISO 8-bit code recommendation. It uses 
line-drawing character set located between BO hex and OF hex. 
Since these line-drawing characters have to co-exist along with 
ASCII and Indian scripts, the PC-ISCII code is designed to 
avoid clash with them. This has been possible through a 
bifurcation of the ISCII character set into two halves. 

The Indian script numerals defined at the end of ISCII code 
table are not included in the PC-ISCII code set. With PC-ISCII 
only the ASCII numerals should be used. These numerals 
themselves can be rendered in shapes of numerals in a 
particular script through an appropriate Attribute (ATR) charac- 
ter. 



Although the characters are at different locations in the PC- 
ISCII code, their sequence remains identical to that in the ISCII 
code. This allows the PC-ISCII code to be functionally identical 
to the ISCII code, enabling the same sorting sequence. 

The positions occupied by the ATR and EXT codes were left 
undefined in the beginning, as some IBM-PC compatibles did 
not allow the corresponding characters to be typed in through 
the keyboard. When this problem was overcome the PC-ISCII 
code was already in wide use, and could not be changed. These 
positions could not be alloted to some new characters, as the 
sorting order would have got affected. ATR and EXT codes (on 
which sorting is not defined) were therefore suitable to fill in 
these two positions. 

The five empty character positions towards the end of the code, 
are reserved in ISCII, but are needed in other script codes (like 
Perso-Arabic code). 



13 



IS 13194:1991 



ANNEX-C 
ENGLISH-ALPHABET ISCII CODE: EA-ISCil 



EA-ISCII is meant for those computers and packages which do 
not allow use of 8-bit codes, or ISO compatible 7-bit codes. 
Here the Indian script characters have to be defined within the 
ASCn character set. By defining the Indian script alphabet in 
place of only the 52 upper and lowercase English alphabet, one 
can ensure that the Indian scripts would be usable, ^fterever 
English alphatiet can b>e used. 

Since all the ISCII characters can not be accommodated di 
rectty, the Nukta character is used to derive some of the lesser 
used ones Only the vowel ^ is given directly others are rep- 
resented as 3T foilowed by ?f'e corresponding Mafra 

Sir>c8 the same codes are now used v-i represent octf- rt'.^- 
English and Indian script alphabet, some -.vay r,f dift:ing! isprvno 
between both is needec. A sriKi!! ' -■" 'ieqiitvaien; 'o HvMr 
character st the beginmg of a word indic*j«.- jhs-J ih^ '•firCfC i?, 
3r. iiTdsan scnpt one. V»^rv 'ew words stsn -A-iih 'f.'' ;n Einc'sri 
H JMJcessary rhey can be written hv- cioubiiriy .ne "t' .:5t \he stan, 
Astano-alone "x" continues to b*i ciisp/ayed nurmifsHy- S-irMar^ 
■ x" after an English word gets displayed no^na'ly. 

This scheme will worK with ail English Eon-v^are allowing ooin 
upper and lowei cases, but will no! wo'K wrjh software ".ivh,.-.;;-; 



allows usage of only one of the cases. Apart from having the 
same equivalent character set as ISCII, EA-ISCII also gives the 
same sorting order. The EA-ISCII codes can be generated by 
the inscript keyboard. It is posible to even generate the Nukta 
automatically before the beginning of an Indian scnpt word 

EA- ISCIi has provision for Attribute (ATR) character, but does 
not have the Extension |£:XT) character. Absence o* EXT 
character prevents use of Vedic characters alonq with fc-A 
iSCi! 

tA-iSCi! due;- not catHi tc li'dpw a'te -Jcnpr nurn8'.'3i'; a.s ■!■■ 
siuded in the iSCU cooe. rio^-ever ihe/^T^* 'jMirs-jMi car aiio^ 
fer3ditK.>r' os tfv; AfiCH -fmerote :n <■■'■€ stiiec'eo G<;rsp! f':'t': 

eA-jSC!l CODE CHAHl: 'm Er,gi.?h uppfti ;,^x* i:«.,r ..,.;:, ^ 
aiDhabet ar*j jrnifrpteU*^ a.^ 'tiw cOp'esiWfKunt, r-.j: .i ■,..:;•';- 

presenJ at the ueaifi'siJig o iht- wofd -nc ■.;'3:'£:C'v:s '^-riowr 
towards ''he riynt yf a coiunc' ^re ooi'-nieo '^v af;p<;'"<J "'- 
N!JKtacode,toir>eccrresr)0(id)!":'.| ii!di.Hi:prn;.' . ••:--?/^-.',-i-'~r-:.v(" 
'n ihe middle (.if -^^ colur'r, V(jw»>iso'?'ef .^lan .-■ ac .M.-Hf--',--; .;v 
appending ftit; •.;o!re:,i;'0;'!fi''''y ?\rtatra 's. - 





Hex 
Dec. 


32 


3 


U 


-'? 






_i_..i_._... 


6 


— -+ 


■ . c: 


Hex 


64 




^> 


80 




'Jb 








SP 


c, 


1 


1 


1 


i 


A 






Q 


a 


a 


7i 


5 


*J 




2 


2 


ta 


2 


B 


'■- 


_ 


R 


q 


b 


T 




.' 


,,.. 


3 


3 


* 


3 




3T 




S 


^ 


c 


I 






*■ 


4 


4 


$ 


4 


D 


^ 


^ 


T 


«J 


d 


rr 




t 


t 


5 


5 


% 


5 


F 


^ 


T3 


U 


^ ^ 


e 


oS 




u 


'i 


6 


6 


& 


6 


F 


n 


TT 


V 


^ 


t 


g 




y 


^ T 


7 


7 


■ 


7 


G 


^ 


^ 


W 


TF 


g 


Tf 




w 


- 


8 


8 


( 


8 


H 


^ 




X 


^ 


h 


T 




;< 


- 


9 


9 


■) 


9 


1 


^ 




Y 


tl 


i 


Tf 




V 


1 5 


A 


10 


• 




J 


^ 


^ 


z 


IT 


J 


T 


INV 


i 


ATR 


B 


11 


+ 


; 


K 


^ 


^ 


[ 




K 


T 








C 


12 


» 


< 


L 


z 




/ 




1 


f 


n 


: 




D 


13 


- 


= 


M 


■3 




] 




m 


1 


T 


j 




E 


14 




> 


N 


^ 


S 


A 




n 


J 




■ 




F 


15 


/ 


? 





3 


3 









" 






DEL 
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ANNEX-D 
INSCRIPT KEYBOARD 



The Inscript (Indian Script) keyboard overlay was standardized 
by DOE in 1986. ("Report of the Committee for Standardization 
of Keyboard Layout for Sndian Script Based Computers", Elec- 
tronics-information & Planning Journal, Vol. 14, No. 1 October 

1986). 

A revision was done in 1 988 by a DOE committee, when it was 
decided to compact the ISC 1 1 code by deriving some characters 
using a separate Nukta character. This required substitution of 
the Nukta character in oiace of the earlier "Transform" key. 
Prom frequency consiagMSijons rt became necessary to rnutu- 
aily adjyst the position s ci t , ;:st, ^, . -Jff vowets, along wAh their 
Matras. 

T"he Inscriot ove'«dy car '■'e used on .any QWERTY keyboard. 

■'"he Indian script :egen«'i":. ^nouid be snown in the right-hand 
s'deorakey. as*ne leri S, ■ j :?'dehaEthe English legends. The 
^nscrio' overlay O'-jv-;; sefecArf v^'ne'-^ Caps- Lock 's active, other- 
wise norinai lower ca?*^ J-ngn^t] overlay gets seiecteo. it is 
oossibte ?o use AJ-Ti-3P*-'f;,L: Key to toggle the Caps -Lock func- 
;:C)nali?y "oerA'eer- tn.-s ^^m^^^i-i orv.i ;-3nti Ihe norrr.al one {-Hh-ere 
capilai =:''■^g!rs^ te^ers ga: s-jiect'Sl). 

Terirporary stjiectio." o' v-^i -^thj' {rveriay can be achieved by 
pressing the Key aicnc; witr> 'he RIGHT ALT key (In IBM En- 
hanced kayboaro) or <f.c 3Y5-REQ key (In PC-AT 88-key 
keyboard; . This can be vo<-y cor. vensent for embedding a single 
character from the other o-/e!!3y 

The inscript overlay ;;ont&i!-re characters required tor ail the 
indian scripts, as defined t:-j !: -s iSCil character set. The Indian 
script alphabet has a logical structure, derived from the 
ohonetic properties. The inscript overlay mirrors this logicai 
structure. The overlay has aiso^oeen optimized from phonetic/ 
frequency considerations, 't is divided into two parts: the vowel 
pad on the left hand side, and the consonant pad on the right 
hand side. 



Within the vowel pad the vowels are given in the shift positions 
of the corresponding Matras. All the five main short vowels are 
given in the home row while their longer counterparts are 
located on the corresponding keys just above them. Since the 
vowel 3T does not have a corresponding Matra, the 3T vowel- 
omission sign, Halant, is given in the unshifted position. Haiant 
is used for forming conjuncts, when it is typed in between con 
sonants. 



J '.oi ? ns 



J* 



^I'e'-natti hanj dCTior '.;eT<- 'i,^f^ i V>^'"r 

'H-iH-'it isty,-('jvi ' :)'" -•10 ''■-'"\ t . *'" -> ls 

'^."C.JtS ^hiliP VP .; ^•-'1? <*" f< ' ^'■ 

^ypsidorabiy -.Ot'vJ > ^u\i >\'' '" i ' '-s'^ 



temdiring cmj ,t. .•■'•- :f tfv o t j v 
ceit'sfion^ an. i.r.-.- ' ^quo \ "-; 



All the characters need-ed for touch fypir^g are t:oritai"ed in 'fie 
bottom 3 rows.Thetop row contains some cofijuncts mear^t for 
ease in. sight typing. The conjunct charactej I'.eys aciusiSy send 
out the corresponding basic cnaracters. 

Due to the phonatiCaiphabetic natifre of She keyboard, a 
person who knows typing in one Indian script can type ^n any 
other Indian script. The logicai structure allows ease in teaming, 
while the frequency considerations aiiow &p«ecl in touch typing . 
The keyboard reftiains optimal both from touch-typing and 
sight-typing points of view, in ai! Indian sciipts. 
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- 3ft 

' t 


! ^ 

1 


2 


# z 

3 


$ " 
4 


% ^ 

5 


6 


7 


* W 
8 


( 
9 


) 



~ 


+ 3R 


BS 


TAB 


Q ^ 

> 


W ^ 


E 31T 
T 


R f 


T -3J 


Y ^ 


U ^ 


1 ^ 


O ^ 




{ ^ } ^ 

I ^ 1 T 


1 ^ 
\ T 


CONTROL 


A 3ft 

> 


S T^ 


D 3T 


F I 

f 


G ? 


H TF 


J X 


K TS 


L «T 


; ^ 


" -5 
7 


RETURN 


SHIFT 


z ^ 


X ^ 


C ^ 




B 0^ 


N o5 




< n 

» 


> 1 


? IT 

/ ^ 


SHIFT 



ENGLISH KEYBOARD WITH INSCRIPT OVERLAY 

The ASCII characters of a standard QWERTY keyboard are on the left half of a key. The Inscript (Indian Script) 
overlay characters are shown on the right half of a key. CAPS LOCK is used to select the Inscript overlay. 



TAB 


3ft 


^ 


31T 


^ f 


^ 


«T 


^ 


■«r 


^ 


^ 


? ^ 


^ 




> 




T 


n -^ 


ev 


"^ 


? 


IT -n 


^ 


■?T -Sf 


T T 


— 






3ft 


T? 


3T 


«5 ? 


"3 


•JFT T^ 


I 


^ ^ 


<r 


W 


S 




CONTROL 


> 


:^ 




^ f 


>4 


^ 


T 


^ ^ 


tT 


^ 


Z 


RETURN 




fr 


3^ 


^ 


"? 


oS 


q5 


7T 


1? 


5 ) 


IT 




SHIFT 


^ 




^^ 


H 


■^ 


C=T 


TT 






^ 


SHIFT 



NUKTA CHARACTERS IN INSCRIPT OVERLAY 



When Nukta " . " is typed after a character, the character shown to its left on the key, is obtained. 



- 


1 


@ 


# - 


$ 


% ^ 


'^ ?I 


& ^E 


* ST 


( 


) 


; 


+ ^ 


BS 


' 


1 


2 


3 " 


4 


5 


6 


7 


8 


9 





- 


< 


TAB 


Q ^ 


w if 


E on 


R ^ 


T * 


Y ^ 


U » 


1 sr 


O *f 


P ^ 


{ © 


} >9B 


1 


CT 


t 


T 


-^ 


«. 


^ 


^ 


9r 


^ 


^ 


[ ^ 


1 T 


\ 


CONTROL 


A v3 


S ^ 


D an 


F t 


G ^ 


H 5P 


J 


K «f 


L af 


: ? 


'■ is 


RETURN 




a 


I 


^ 


r 


*. 


■n 


^ 


^ 


^ 


; p 


' t 




SHIFT 


z 


X "^ 


C «r 


V 


B 


N 


M «r 


< T? 


> 1 


? ^ 


SHIFT 






? 


^ 


H 


i 


5f 


71 


* 


• 


/ ^ 





INSCRIPT OVERLAY FOR ASSAMESE 

Notes ; - Nukta : typed after *5 and 15 gives ? and ? respectively. 

- The macro-keys in the top-row generate: 
(Rakar) 3= 7^ (Reph) " =^ 7 
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~ 9 


! 


@ 


# 


$ 


'/o 


A 


& «» 


<*»(r 


( 


) 


ft 


+ 


BS 


' On 


1 


2 


3 


4 


5 


6 


7 


8 


9 





- 


= 


TAB 


Q $ar 


w m 


E ^ 


R Ff 


T ea 


Y 


U lu 


1 


O 


P 


{ 


} (© 


1 




Oct 


eo 


IT 




JI 




S© 






88 


I 


1 


\ 


CONTROL 


A 9 


S <? 


D ^ 


F @ 


G fi_ 


H 


J ID 


K 


L 


; 


fi 


RETURN 




G(t 


a 




1 


J 


u 


IT 


a 


^ 


, ff 


L_ 




SHIFT 


Z CT 


X 


C oir 


V or 


B IP 


N ar 


M 


< a^ 


> 


? 


SHIFT 




Q 




U) 


P 


eu 


a) 


ou 


» 


• 


/ UJ 





INSCRiPT OVERLAY FOR TAMIL 

Notes : - y|5 is got by typing a%cr fromlhe top row followed by " 
- Conjunct 9^ can also be typed as s'o^ 



~ 






* 7 


$ -^ 


% IT 


A ^ 


\3 




( 


) 

O 


- 


= — 
e 


BS 


TAB 


O 3ft 


W ^ 


E <Rl 


R f 


T ^ 


Y ^ 


L' ^ 


1 ^ 


O ^ 


P IT 


{ ^ 


} ^ 


1 ^ 




> 


^ 


T 


t 


€!v 


^ 




^ 


t 


-^ 


^ 


[ T 


] 7 


\ T 


CONTROL 


A 3lt 


S XT 


D 3t 


F I 


G -3 


H 


tf? 


J .T 


K ^ 


L ST 


: ^ 


S 


RETURN 




> 




*\ 


t 


^ 


^ 


T 


^ 


Jf 


' ^ 


z 




SHIFT 


Z 


X " 


C -Of 


V - 


B 


N s 


M ^ 


< -^ 


> 1 


? 


SHIFT 








TT 


^ 


g 


H 


^ 


» 




^' -^T 





INSCRIPT OVERLAY FOR DEVANAGARI 

Notes: - 1 is used in Marathi, before ~ to derive haif-ra - (as in w^) 

- ^ is used in M^athi 

-Nukta" can be typed after ^^n^T^^S" i 
toget ^ ^ TT ^ ^ ^ Ts -aSi s 

- The Macro-keys in the top-row generate: 

(Rakar) T = " T (Reph) "^ = T - 

?T = ^-^ =r=cT-T ^ = ^-1? s^^-^-j 



~ 


! 
1 


@ 
2 


# ^ 
3 


$ ' 

4 


5 


6 


& a 

7 


* «) 

8 


( 

9 


) 



_ i 


+ © 


BS 


TAB 


Q Gl 


w ■^ 


E n 


R a 


T Q 


Y €k 


U G« 


1 Q 


tJ 


P « 


{ « 


} e 


1 




61 


6 


1 


1 


<L^ 


© 


9 


€1 





Q 


I 


1 


\ 


CONTROL 


A Q 


S XI 


D a 


F SI 


G a 


H 0> 


J 


K ei 


L a 


: s^ 


■• 


RETURN 




S 1 


6 


s 




o^ 


a 


Q 


Q 





; © 


6* 




SHIFT 


Z 


X - 


C §1 


V 


B 


N ffi 


M €1 


< B 


> 1 


? a 


SHIFT 









fi 


fl 


9 


a 


g 


1 


• 


' Q 





INSCRIPT OVERLAY FOR ORIYA 

Notes : - Nukta t typed after and Q. gives © and Q respectively. 
- The Macro -keys in the top - row generate : 

( Rakar) ^ = , Q ( Reph ) '= 6> ^ 
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! 
1 


@ 

2 


# ^ 

3 


$ 

4 


% £3 
5 ' 


; * 


& 1 

7 


8 


{ 

9 


) 




• 
• 




BS 


TAB 


Q a 


W 50 


E « 

"5 


R * T erus 


Y z^ 


U 23 


1 ^ 


O z;! 


p axi [ z^ 


] 


1 
\ 


CONTROL 


A L 


s i:) 


D « 


F '^ 

3 


G «A3 



H 5d 


J 


K 30 


L TP 


: cp 


* 13 


RETURN 


SHIFT 


Z ii 

3 


X 

■ 


c ra 


V 


B 


N •;? 
O 


7i 


< (^4 


> 


/ at) 


SHIFT 



INSCRIPT OVERLAY FOR KANNADA 



Notes : - The Macro-keys in the top-row generate : 



(Rakar) ^=-^d 



1=^^^ 



sJ=-E!-^d 



- 


! 


@ 


# - 


$ - 


% ^ 


^ g 


& TE 


* 3 


( 


) 


: 


+ i-jf 


BS 




1 


2 


3 


4 


5 


6 


7 


8 


9 







~ < 




TAB 


Q \5 


W 1^ 


E ^ 
T 


R ^ 


T d 


Y ^ 


U « 


1 'a 

9T 


O «* 

^1 


P ^ 


{ 15 

[ ^ 


} s» 
] 7 


1 
\ 


CONTROL 


A *3 


S il 


D 31 


F $ 


G ^ 


H 5F 


J 


K ^ 


L af 


: ^ 


" ^ 


RETURN 




a 


I 


. 


r 


» 


n 


IT 


^ 


^ 


; c 


' B 




SHIFT 


Z 


X ■" 


C ef 


V 


B 


N 


M t 


< 55 


> 1 


? ■^ 


SHIFT 






? 


^ 


i^ 


^ 


5T 


71 


) 




/ ^ 1 



INSCRIPT OVERLAY FOR BENGALI 



Notes 



Nukta . typed after'® and 15 gives ^ and ^ respectively. 
The macro-keys in the top-row generate: 
(Rakar) -= 7<l (Reph) " =^ 7 



5r = «f 



~ a. 


! 


@ 


# r 


$ 


% ^ 


A ^^ 


& i 


* L^ 


( 


) 




+ 2X0 


BS 


CTt 


1 


2 


3 


4 


5 


6 


7 


8 


9 





- 


= ;> 




TAB 


Q C 


W SD 


E «5 


R 4> 


T ta« 


Y ^ 


U a 


1 (^ 


^ 


Pdcp 


{ <$ 


} ^ 


! 


mj 


o_ 


-° 


i 


J" 


20 


^5- 


6 


6 


s, 


[ ^ 


] 


\ 


CONTROL 


A ^ 


S <i:» 


D es 


F a 


G ^ 




H ;;) 


J 86 

6- 


K ^ 


L ^ 


: 4 
; ^ 


" 6 


RETURN 


SHIFT 


Z oi 


X 

o 


C GO 

c2b 


V 


B 

<6 


N <^ 




* 


> 


9 


SHIFT 



INSCRIPT OVERLAY FOR TELUGU 



Notes : - The Macro-keys in the top-row generate : 

(Rakar) C= ^^ 



^ = 4^6 
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^ 6) 


! 

1 


@ 
2 


* L 

3 


$ 
4 


% 
5 


A 

6 


& dSyu 

7 


* 
8 


( 

9 


) 




1 


+ 8 
= A 


BS 


TAB 


Q ae) 
1) 


WCf^ E f8»9) R gK5 T 2'D 

6>« 1 ^ 


Y S 

> 


U 613 


1 o&J 

CO 


O (JU 
s 


P (OOi 
8 


{ <\J9 


} ant 

] - 


1 
\ 


CONTROL 


A «0 
a 


S <^ 


D <8» 


F m 
1 


G g 
3 


Hoc 


J o 

(0 


K 6U 


L Ui 

(5) 


: afl) 

; aJ 


" o 
' s 


RETURN 


SHIFT 


6) 


X 




C em 

IB 


V 

CD 


B V 

OJ 


N § 


(TO 


1 


> 


? 

/ 00) 


SHIFT 



INSCRIPT OVERLAY FOR MALAY ALAM 

Notes : - The Macro-keys in the top-row generate : 

(Rakar) L = " <'> <^^ = «^""*' 

- 5) is formed by typing f> "<^ 

- Alternate forms of some half characters are obtained by typing in a Nukta T as shown; 



~ 


I 


@ 


# - 


$ ' 


% a 


" 51 


& ?l 


* «l 


( 


) : 


+ « 


BS 




•i 


i 


3 


y 


u 


^ 


>9 


c 


e 


o 


C 


TAB 


qhI 


W 5^ 


E 5HI 


R fi 


T a 


Y (H 


U 4- 


1 El 


O H 


P Oi 


{ & 


] 01 


1 


n 


^ 


I 


1 


<\ 


K 


^ 


31 


t 


<v 


I ^ 


1 


\ 


CONTROL 


A5Hl 


s *;i 


D 5H 


F 6 


G 6 


H !f 


J 


K M 


L n 


: t9 


" k 


RETURN 




1 




^ 


r 


o 


\ 


^ 


^ 


d 


; =H 


I 




SHIFT 


z 


X * 


C §1 


V 


B 


N q 


M «l 


< H 


> 1 


? 


SHIFT 








H 


n 


1 


<=l 


«l 


f 


• 


/ H 





INSCRIPT OVERLAY FOR GUJARATI 



Notes : - The Macro-keys in the top-row generate : 

(Rakar) ^ = ^ 1 (Reph) ' = i^ 

a='VO(. 5l=rt^ i!l=iH 



«i = «i i 



~- 


! 
1 


@ 

2 


3 


$ 

4 


% 
5 


A 

6 


& 

7 


* 

8 


( 
9 


) 




- 


+ 


BS 


TAB 

_.,, _, ... - 


Q M 


W ^ 


E »F 

T 


R ^ 

1 




Y H 


u s 


1 u/ 


o a 


P y 


{ ^ 
I 5 


} ? 

] : 


1 

\ 


CONTROL 


A § 


S ^ 


D >K 


F fe 

f 


G t 


H ¥ 


J 


K Vi 


L H 
3 


: s 
; H 




RETURN 


SHIFT 


z 


X 

O 


C ^ 

K 


V 


B 


N W 


M B 


< 
1 


> 1 


? 


SHIFT 



INSCRIPT OVERLAY FOR PUNJABI 



Notes : - 



Rakar ^ 


= Z ^ 
















Nukta : 


can be typed after 


cT 


if 


^ 


fl 


^ 


^ 


¥ 




to get 


cT 


y 


3T 


H 


3 


5 


?^ 
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ANNEX-E 
ATTRIBUTE CODES 



An ASCII character which follows the ATR character indicates 
a new font Attribute which is applicable for the subsequent 
characters till the end of the row, or till another attribute code is 
encountered. 

The ASCII character, following the ATR character, can indicate 
94 different attributes. Out of these the first 31 attributes are re- 
served for display attributes, while the rest of 63 attributes 
indicate selection of a font for a new script. 



Hex 


2 


3 


4 


5 


6 


7 







BLD 


DEF 








1 




ITA 


RMN 






ARE 


2 




UL 


DEV 






PRS 


3 




EXP 


BNG 






URD 


4 




HLT 


TML 






SND 


5 




OIL 


TLG 






KSM 


6 




SHD 


ASM 






PST 


7 




TOP 


ORI 








8 




LOW 


KND 








9 




DBL 


MLM 








A 






GJR 








B 






PNJ 








C 














D 














E 














F 















<-ATR Codes-x — FONT Codes > 

< Normal— -><-Reverse-> 



E-1 Display Attributes (21h to 3Fh) 

An Attribute code indicates a new display attribute, which is 
effective till the end of a line, or till the same attribute. At the 
t)eginning of a line all the display attributes are supposed to be 
off; subsequent occurence of a display attribute causes tog- 
gling of the attribute on the display. Different attributes can 
combtne together to give composite attributes. 



mew Italics Underline 



Basic Attributes are: 
Highlight Bold Outline 
Expanded 

These can combine together to give different effects: 

Hightight+Bold = ExtraBold 

EULT ©(Eftlfiniia ©ILEi ©ffllsaflsiQ 
EIL.'iF-o-S&l® ©meaflffls 

!I]lL<tf<Q>IDlL!D SlbQcOs^ 

Expanded characters are of Double width. 

Double Height characters can be achieved by duplicating the 
word on two consecutive lines and then using the TOP attribute 
on the top line and LOW attribute onihe bottom line. Since TOP 
and LOW attributes also work on toggle basis, it is possible to 
have a mixture of double height and single height characters 
within the same row. 

It is possible to create variety of effects using all these attrib- 
utes: 



^^f\llV«la UoirtKit + 



I^VUUIV IIUIC 



Double Height 



A-i-Q Provides for SCRIPT Processing 
A I hi a NEW 

Breathtaking ©BQ^IMSBOM 



The DBL, Double size row attribute makes the whole row 
double-height and double-width. This can be used along with 
TOP and LOW attributes to get quadruple size characters. 

E-2 Font Attributes (40h to 7Eh) 

At the beginning of a row the default display script is assumed 
to be active. The font attributes cause selection of a new script 
till the end of a row, or till another font attribute is encountered. 

E-2.1 DEF (Default Font) 

The Default font attribute causes re-selection of the default 
display script. 

E-2.2 RMN (Roman Font) 

A Roman script corresponding to a particular non-English 
script, is rendered using only English alphabet along with 
suitable diacritic martts. The Roman script transliteration is 
useful for making legible a script not known by a person. A 
family of scripts (like the Indian scripts, or Perso- Arabic scripts) 
will have a common set of diacritic marks. 
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The RMN font attribute selects Roman script corresponding to 
the currently active script. The numerals after the RMN attribute 
will be shown as international numerals. 

E-2.3 Indian Script Fonts (42h to 4Bh) 

This selects a Brahmi based Indian script. The subsequent 
numerals will be shown in the forms corresponding to the script, 
if they exist, otherwise they will be shown in their international 
form. 



E-2.4 Perso-Arabic Fonts (71 h to 76h) 

These scripts are written from right to left, in general codes from 
71 h to 7Eh are reserved for scripts written in the reverse 
direction. The Perso-Arabic family contains Arabic (ARB), 
Persian (PRS), Urdu (URD), Sindhi (SND). Kashmiri (KSM) 
and Pushto (PST). Amongst these, Urdu, Sindhi and Kashmiri 
belonging to the Indian sub-continent have considerable simi- 
larity. 

The ASCII numerals will be shown in the Perso-Arabic form, 
after a Perso-Arabic font attritxjte. 
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ANNEX-F 
ROMAN SCRIPT TRANSLITERATION 



The National Library at Calcutta standardized the diacritic 
marks to be used for romanization of Indian scripts, in 1988 
{"The National Library Newsletter", June 1988). 

As Northern scripts do not have short ^ and #, the long ^ and 
^ can also be rendered without diacritic mark as 'e' and 'o' 
respectively. 

Unlike Sanskrit and Southern scripts, in the Northern scripts the 
implicit vowel "a" at end of a word is not pronounced, and thus 
should be left out in the transliteration. Example: amt^ = a6ok, 
TirT=ramari. This also applies for nasal conjuncts where a Varg 
consonant is preceded by a Nasal consonant belonging to the 
same Varg, Example:'5^=bandh,^«r=kamp, Words ending in 
other conjuncts, still retain the implicit "a" vowel. Example: ^= 
putra, TT^ = rastra, f*T?f = miSra. 

VOWELS 



5R3Trf^ ^^^^ 


c? 


^ 


a a i i u u r r 


1 


I 


^ XI ^ ^ 3tt 3ft 3?t 


3Tf 




e e ai e o 6 au 


6 





CONSONANTS 





The Five Vargs 




■^ 


f3 


•n 


^ 


T- 


k 


kh 


g 


gh 


h 


^ 


^ 


^ 


¥ 


^ 


c 


ch 


J 


jh 


n 


Z 


■3 


T 


S 


^ 


t 


th 


d 


dh 


n 


TT 


«T 


^ 


«r 


■^ -T 


t 


th 


d 


dh 


n n 


^ 


■qr 


^ 


«T 


■*T 


P 


ph 


b 


bh 


m 



Non-Vargs 



^ 


IT 


T 


X 


^ 


oJ 


y 


y 


r 


I 


I 


1 


aS 


"^ 


■?T 


^ 


^ 


¥ 


Z 


V 


s 


s 


s 


h 



Nukta Consonants 



■¥ 


w 


?T 


^ 


T 


^ 


TF 


q 


kh 


gh 


z 


d 


dh 


f 



VOWEL MODIFIERS 



Chandrabindu ~ 


m 


Visarg T 


h 


Anuswar 




before'^ varg 


h 


before "^ varg 


n 


before ■? varg 


n 


before 'tT varg 


n 


before ^ varg 


m 


othenwise 


rn 



Notes : 



W, ^, c5 are used only in Sanskrit 

^ = short ^ in Southern scripts 

3lT = short 3ff in Southern scripts 

^ = new vowel in Devanagari, as in "bat" 

an = new vowel in Devanagari, as in "ball" 

^ n = an in Tamil 

^ y = ^ in Bengali and Oriya, while "^ y = ^. 

T r = Tamil( jp ), Teliigu( as ), & Malayalam( o ) 

3C, r = - in Marathi 

aS I used in Marathi 

■3 ! =Tamil(6tr), Malayalam( s ), Telugu( '^ ) & 

Kannada(^) 
off 2 = Tamil( jp ), Maiayalam( f ) 
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ANNEX-G 
EXTENDED CHARACTER SET FOR VEDIC 



The ISCII codes lor Devanagari catered to all the characters 
required for typing Hindi, Marathi and Sansl^rit. However they 
could not contain the additional characters required for repre- 
senting ancient Vedic text. Many of these Vedic characters 
combine with other Devanagari characters. The Vedic charac- 
ters cannot be thus thought of as constituting an independent 
script, but have to be catered to as an extension to the ISCII 
character set. 

ISCII code provides an Extension code (EXT) which redefines 
the following ISCII character as another character not present 
in the ISCII code. Through this extension technique it is 
possible to represent, apart from Vedic, miscellaneous charac- 
ters required for other Indian scripts. 

It is not sufficient to provide only the extended character set for 
Vedic, it is necessary to make sure that they can be typed in a 
simple way. This necessitates a Vedic Keyboard overlay along 
with the Inscript overlay. While remaining in the Inscript key- 
board overlay it should be possible to select single characters 
from the Vedic overlay. This has been made possible through 
an Extension key, which when pressed along with an ISCII 
character, gives the corresponding character in the Vedic 
overlay. 

The Vedic characters are ignored in transliteration to another 
Indian script and to the Roman script. 

G-1 Nature of the Vedic Characters 

In ancient times Vedic text was characterized by a tonal or pitch 
accent: one syllable in a wordwas pronounced with a different 
musical pitch than the others. In Sanskrit the pitch accent was 
kept alive for some centuries after PSriini, after which it finally 
disappeared in pronunciation. 

Vedic text gives importance to correct representation of tonal 
marks along with the vowels. These tonal marks are called 
Svaras. The three main Svaras are Udatta, AnudStta and 
Svarita. Out of these UdStta has generally no indication. 

Different Vedic texts however have no consistency irv usage of 
diacritic marks to indicate these Svaras. The same symbol may 
get used by different authorities for different purposes. 

In the Vedas there are three lengths for a vowel. These are 
short, long and extra-long C?^, "fW, ^^). The short and long 
vowels are denoted by the normal vowel signs used in Deva- 
nagari, while the extra-long vowel is indicated by putting a ) sign 
after a short or long vowel sign. Example: arts^sunl TRT. 



G-l.iUdatta(H'^rTr) 

The vowel that is perceived as having a high tone is called 
Udatta, or acutely accented. It is normally not marked. ".." is 
used in S ukia Yajurveda texts, at the end of a sentence. 



G-1 .2 Anuddtta ( 313!^ ) ( _) 

The vowel that is perceived as having a low tone is called 
Anudatta, or gravely accented. In writing it is marked by a line 
underneath the vowel. It also denotes Udatta in S atapatha 
Brahmaria. In KSthaka text, Anudatta is shown as a vertical line 
below the character (J. 



G-1 .3 Svarita ( T^t^) ( ' ) 

The vowel that is a combination of Udatta and Anudatta tones 
is said to be Svarita or circumflexly accented. It is pronounced 
by combir^ raising and falling of the voice. It is marked by a 
vertical line above a chsracter. 

This definition applies in Rigveda. However the same sign is 
defined as Udatta in Yajurveda's Maitrayarilya text. 



G-1 .4 Long Svarita (^ T^ftw) ( " ) 

Long Svarita is denoted by two vertical lines above a character 
( "), in some Taittiriya texts. 



G-1.5 Kampa C^m ) ( '_) 

Kampa is vibration in the voice while reciting . The Kampa arises 
only when a Svarita is followed immediately by an Udatta (or 
Svarita). In such situations, a low-pitched Matra is inserted 
between them. The recitation sequence consists of high pitch 
of Svarita, followed by low pitch of Kampa, and then the high 
pitch of Udatta or Svarita, which causes the vibration. The 
Matra which is inserted between the Svarita and Udatta (or 
Svarita) is indicated by figure ^ if the Svarita is short and by 
figure 5 if it is long. The figures are marked as both Svarita and 
Anudatta, i.e. { and v 



G-1 .6 Jihvamuilya {f^f^grJ^ftT or^T^TRt^) ( x ) 

This is like a half-Visarga sound, and can come only before four 
consonants. Before ^ and t3 it is called Jihvamuilya, while 
before ^ and "^ it is called UpadhmSniya. 

Example: 3lf*^: + ^31^ = sqfr^xyi^Rl 
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G-1.7 Puspikfl (3f&T^ or «infshd?l) ( «! ) 

This symbol is just a substitute for the spaces between words, 
and hence is not needed. 



G^1.8 Yajurvedic Anusvdra 

Anusvara followed by non-Varga characters is pronounced 
with a T^ element mixed with a ^ vowel. This is also known as 
Guriiiar (^pFR). 

- «x is used in Shukia Yajurveda. 

- y is used in Krishna Yajurveda. This is also used in Shukia 

Yajurveda. 

- ■* is an elongated Krishna Yajurvedic Anusvara. 

- &, if, es, J, if, %, & are the variants of Yajurvedic Anusvaras 

found in different editions and seem to be redundant. 



G-1.9Visarga 

There are many variants of the normal Visarga ( t ) found in 
the Vedic texts like - <? (after Udatta), > (after Svarita) ,■=■,<;, and 
%. All of these seem to be redundant. 



G-1. 10 Others: 

- J" Atharvavedic Jatya Svarita is attached after a character. In 
some texts, it is shown preceding a Visarga. But it is kjetter to 
show it after Visarga, since that is the logical order. 

- Svaras in Samaveda are marked by ?,^ ,^, ^, T and 
3 placed above the characters. These Svaras, when required, 
can be placed in the corresponding positions of the previous 
row. As such no special symbols are needed for them. 



G-2 Extension Codes for Vedic 

The Extension character (EXT), defined in ISCII, can be fol- 
lowed by another character in the range of A1 h to EEh, to yield 
additional characters. 



Code 


ISCII 


Vedic 




Hex Dec. 


Char. 


Char. 


Name 


A1 161 


j» 


i 


Visargal 


A2 162 


-1- 


«! 


Filler 


A3 163 


-r 


?> 


Yajurvedic Anusvaral 


A4 164 


•3T 


& 


Yajurvedic Anusvara2 


A5 165 


3?T 


6, 


Shukia Yajurvedic Anusvara 


A6 166 


T 


* 


Yajurvedic Anusvara3 


A7 167 


i 





Yajurvedic Anusvara4 


A8 168 


■3 


6 


Visarga2 


A9 169 


^ 


q 


VisargaS 


AA 170 


W 


i 


Visarga4 


AB 171 


^ 


^ 


Visarga5 


AC 172 


^ 


y 


Jihvamuliya 


AD 173 


^ 


V 


Krishna Yajurvedic Anusvara 


AE 174 


1 


H 


Krishna Yajurvedic Long Anusvara 


AF 175 


^ 


£■ 


Yajurvedic AnusvaraS 


80 176 


# 


£• 


Yajurvedic Anusvara6 


B1 177 


4 


? 


Yajurvedic Anusvara7 


B2 178 


3Tf 


? 


Short Kampa 


B3 179 


sw/ 


3 
.R 


Long Kampa 


B4 180 


15 


f 


Jatya Svarita-Atharvaveda 


85 181 


TT 


' 


Svarita 


86 182 


^ 


i< 


Long Svarita 


87 183 


S 


1 


Kampa 


88 184 


'^ 


. 


An udatta 


89 185 


^ 


t^ 


Jatya Svarita (Shukia Yajurveda) 


BA 186 


^ 


^ 


Jatya Svarita (Maitrayaniya) 


88 187 


¥ 




Sentence ending Udatta 


BC 188 


■3T 


- 


Jatya Svarita 
(Non-Taittiriya Yajurveda) 


BD 189 


Z 




Svarita (Maitrayaniya) 


BE 190 


S 

MIS 


1 

JELL 


Anudatta (Kathaka) 
f^NEOUS 


8F 191 


^ 


' 


Abbreviation sign 



G-3 Structure of Vedic Characters 

The Svarita symt»ols go above, below or after a character. Only 
one symbol can be attached at a time. The character can be a 
Full Consonant, Pure Consonant (with Halant), Vowel or an- 
other stand-alone Vedic symbol. 
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G-3.1 Side Svara ^r 

Will go to the extreme right of the character, even after the 
Visarga. Example: 'TJ' "g/ f*T:r 

G*3.2 Top Svara % ^ 

These will attach at the top of a character. However if there is 
already some other Matra, Anusvara or Chandrabindu sign 
present on the top, then the top-Svara will attach to the right of 
it. Example: 



^ 


^ 


it 


^r 


^ 


f*^ 


n 


* 


^ 


^ 


^ 


^ 


u 


^ 


3t 


# 


i 


^ 


^ 






^ 


f 


^ 


art" 


3?r 







G-3.4 Bottom Svara: 7 ^ ^ ^ F ^ ¥ 

These will attach below the character. However if there is 
already a Matra below it, then they may combine with it or will 
get placed after it. 

The Svaras _ , ^ , „ , . and^ combine with the Matras ~ ,~ , 
~ as well as the Halant sign ~ . 



¥ 


f 


1 


T 


¥ 


^ 






^ 


^ 


^ 


"f 


t^ 


^ 






■? 


f 


J 


f 


f: 


1 






^ 


¥ 


1 


¥ 


¥ 


^ 






IF 


^ 


^ 


1^ 


¥ 


5 






? 


g 


^ 


1 


T 


?. 


3j 


^ 


^ 


■5 


\ 


■^ 


^ 


^ 


^ 


1 


? 


^ 


1 


1 


^ 


^ 


3J 


^ 


^^ 


n 


1. 


^ 


T 


1 


2? 


1 


g 


^ 


1 


1 


T 


^ 


^ 


^ 



The Svaras and , do not combine with the Matras, but are 
placed after them. 

'^ T- '^ f. f- '^ 

¥ ^ ^ ^ ¥ ^ 



H 
■R 



1; 



T 
^ 



'T 
^ 






^ 

'C 



3f 



^ 
^ 



G-4 Keyboard Overlay for Vedic 



A character on the Vedic overlay is selected by pressing it along 
with the Extension key. The Extension key can vary in location 
depending on the type of keyt)oard and implementation. On an 
84-key IBM PG-AT keyboard, the Extension key can be the 
same as the SYS-REQ key. On the 101 -key IBM Enhanced 
Keyljoard, the right-ALT key can be used as an Extension key. 
The right-ALT key, being much closer, is more convenient than 
the SYS-REQ key; this makes the keyboard more convenient 



for speedy typing of Vedic text. 

The Extension key should be thought of as another kind of 
SHIFT key, which has to be pressed along with a character key. 
It is effective only when the Inscript overlay is active (CAPS- 
LOCK is on). Each key typed along with the Extension key 
emits a character pair, which join up on the display to show the 
desired character. The first character is the EXT character, 
while the second is a Devanagari character. Thus two back- 
spaces would be required for deleting both these characters. 
Example: 

EXT-KEY + H-key = EXT + T => ?? 

EXT-KEY + T-key = EXT + ^ => " 

Vedic Keyt>oard Overlay 





1 


2 


3 


4 


5 


6 


'7 


8 


9 





- 


= 


\ 






^ 




^ 










f- 


& 




in 










Q 


W 


E 


R 


T 


Y 


U 


1 





P 


I 


1 


VISARGA 




' 


- 


- 






% 


X 


-^ 


<■ 


-i 


i 








A 


S 


D' 


F 


G 


H 


J 


K 


L 


» 


• 


ANUSVARA 




- 


^ 








w 


tj, 


* 


<f 


? 










z 


X 


C 


V 


8 


H 


M 


) 


. 


1 














r 


^ 


■* 


£• 


' 


5 


3S. 









SVARA 



MAIN SYMBOLS 



The Vedic overlay has been designed to provide ease in typing 
of Vedic symbols. Thus symbols used often are located on 
more convenient positions. All the Svaras (tones) are located in 
the ieft-half of the keyboard, to be typed by the left hand. These 
Svaras attach on the characters typed before them. 

The ? and ^ symbols are provided on the corresponding Eng- 
lish numeral keys. These should be used for indicating short 
and extra long vowel sounds rather than the normal numerals. 

Main Vedic symbols are provided in the right half of the overlay. 
The Svara characters can be attached on these. The important 
Visarga and Anusvara symtwls are given in separate rows. 

Important exceptions are the three symbols located in t>ottom 
rignt of the overlay. The abbreviation sign (») is given for tradi- 
tional usage: in modern usage it gets replaced by the full-stop 
sign. The Avagrah (5) and Om (aS>) symbols are given for 
convenience only; these could have been typed on the Deva- 
nagari overlay using the Nukta (7) key. 

In Vedfc texts, one may wish to show an Explicit Halant which 
would be shown on the previous consonant, and which would 
prevent the consonant from joining with the next one. The 
Explicit Halant can be formed by typing Halant twice. 
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G-4.1 Typing Order 

The Svara symbols which attach above, below or after a Non-Svara (R) 

character, should be typed at the end of a composite character 

Example' '^" *'^® Vedic characters jexcluding the Svara. 

^''~-' *"^"^-~T' ++.+I- -y Following is the extension to the ISCII code syntax, required for 

Vedic non-Svara characters can take only a Vedic Svara on it. Vedic syllables. 
Example; Vedic-Syllable ::= Vedic-Cons- Vowel-Syllable | 

^" ~ ^' ^ * _ = b « ^ .-- 1 Vedic- Vowel-Sylfable | Full-Vedic-Syliable 

G-5 Vedic Syllable Syntax Vedic-Cons-Vowel-Syllable ::= Cons-Vowel-Syllable S 

Vedic-Vowel-Syllable ::= Vowel-Syliabie S 
Svara (S) 

Full-Vedic-Syllable ::= R S 
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ANNEX-H 
ISCII 4N TELEX/TELEPRINTERS 



The Department of Telecommunication (DOT) has adopted the 
lSSCII-83 (Indian Script Standard Code for tnformation Inter- 
change) code, a DOE 1983 standard, for use in Roman/ 
Devanagari telex/teleprinters. An 8-bit ISSCI character is 
transmitted as two 5-bit characters. These machines initially 
interact in Roman using 5-b(t Baudot code (CCITT Alphabet No. 
2). A protocol is defined, by which machines at both ends, can 
enter and exit the ISSCIl-83 mode. 

Hindustan Teleprinter Limited (HTL) introduced the first Ro- 
man/Devanagari letex machine in 1987, which uses the 
ISSCH-83 code. It uses the Phono-graphic keyboard standard- 
ized by DOE in 1983. 

Due to the large installed base of the telex/teleprinter ma- 
chines, it is not desirable to upgrade them to the new ISCII 
codes and the Jnscript keyboard. For compatibilty it would be 
necessary for the new machines to continue using the ISSCil- 
83 code for communicating with the old machines. The new 
machine can fall into two categories. 

1 , Those which are compatible to the old bilingual machines 
and use the lSSCn-83 code along with the phono-graphic 
keyboard. 

2. Multilingual machines, which communicate to the old ma- 
chines using lSSCII-83 code. They will use ISCII code for 
interaction with other multilingual machines. All the Indian 
scripts are provided through the common Inscript key- 
board overlay. In addition they can have alternate overlays 
for specific scripts as approved by the Department of 
Telecommunication . 



H-1 ISSCIl-83 Syntax 

The ISSCII-83 characters used in bilingual telex machine can 
be classified as: 

Consonants (0) 
Matras (M) 

T ft ; I T " " T>> 

SIGNS (S) 



Symbols (SYM) 

33> ^ ^. 5 

Link(LNK) 

A Devanagari word can consist of one or more syllables. Syntax 
for a syllable is given in the following Backus-Naur Formalism 
(BNF). 

Syllable ::= Vowel-Syllable | Cons-Syllabie | SYM 

Vowel-Syllable :> LNK M [S] 

Cons-Sydable ::= [C INK] [C LNK] [C LNK] C JM] [SJ [S] 
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Table -1 : ISSCII-83 Table of the Latin and Indian Script alphabet 





Hex 
Dec. 





1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


B 


c 


D 


E 


F 


Hex 





16 


32 


48 


64 


80 


96 


112 


128 


144 


160 


176 


192 


208 


224 


240 








NUL 


DLE 


SP 





@ 


P 




P 






33. 


^ 


^ 


•qr 


- 


- 


1 


1 


SOH 


DC1 


! 


1 


A 


Q 


a 


q 






- 


^ 


^ 


^ 


T 




2 


2 


STX 


DC2 




2 


B 


R 


b 


r 






- 


'^ 


^ 


«r 


f 


LNK 


3 


3 


ETX 


DC3 


# 


3 


C 


8 


c 


s 






T 


^ 


s 


1 


^ 


^ 


4 


4 


EOT 


DC4 


Rs 


4 


D 


T 


d 


t 






DSP 


3n 


T 


^ 


.J 


^. 


5 


5 


ENQ 


NAK 


% 


5 


E 


U 


e 


u 






5 


3tt 


^ 


T 


<K 


J 


6 


6 


ACK 


SYN 


& 


6 


F 


V 


f 


V 






3f 


3it 


s 


.T 


- 


«? 


7 


7 


BEL 


ETB 


■ 


7 


G 


W 


g 


w 






31T 


^ 


^ 


?T 


i. 


^ 


8 


8 


BS 


CAN 


( 


8 


H 


X 


h 


X 






? 


■^ 


T 


•35 


^ 


^ 


9 


9 


HT 


EM 


) 


9 


1 


Y 


i 


y 






i 


""I 


TT 


°? 


^ 


?T 


A 


10 


LF 


SUB 


* 




J 


Z 


i 


z 






^ 


■fJ 


«T 


■g 


^ 




B 


11 


VT 


ESC 


+ 


; 


K 


I 


k 


{ 






^ 


^ 


^ 


U 


A 




C 


12 


FF 


FS 


, 


< 


L 


\ 


1 


1 






m 


^ 


^ 


^ 


T 




D 


13 


CR 


GS 


- 


= 


M 


] 


m 


} 






^ 


'^I 


^ 


■^ 


^ 




E 


14 


SO 


RS 




> 


N 


A 


n 


~ 






^ 


^ 


■?r 


^ 


> 




F 


15 


SI 


US 


/ 


? 


O 


- 





DEL 






?S 


^ 


T 





y 
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CONVERSION FROM ISSCII-83 TO tSCII 



AO 


3jo 


A _ 


A1,E9 


A1 


*_ 


_•_ 


A1 


A2 


— 


— 


A2 


A3 


- 


— 


A3 


A4 


DSP 


SP 


20 


A5 


5 


1 - 


EA,E9 


A6 


3T 


3T 


A4 


A7 


aRT 


m 


A5 


A8 


J 


J 


A6 


A9 


i 


i 


A7 


AA 


^ 


7 


A8 


AB 


■^ 


^ 


A9 


AC 


IT 


^ 


AA 


AD 


Tfl 


WT 


AA,E9 


AE 


^ 


? T 


A6,E9 


AF 


^ 


i - 


A7,E9 



DO 


■<ir 


^ 


09 


D1 


'^ 


^ 


CA 


D2 


^ 


«T 


CB 


D3 


T 


T 


CC 


D4 


^ 


^ 


CD 


D5 


T 


T 


CF 


D6 


T 


T 


DO 


D7 


H 


H 


D1 


D8 


qS 


^ 


D2 


D9 


off 


? 


D3 


DA 


^ 


^ 


D4 


DB 


^ 


7T 


D5 


DC 


^ 


^ 


D6 


DD 


^ 


^ 


D7 


DE 


¥ 


^ 


D8 


DF 





T~T 


D0,E8.D0 



BO 


1 




AE 


B1 


^ 


^ 


AB 


B2 


^ 


^ 


AC 


B3 


^ 


^ 


AD 


B4 


3ff 


3rf 


B2 


B5 


3lt 


3ft 


AF 


B6 


3it 


# 


BO 


B7 


3?r 


# 


B1 


Be 


^ 


■^ 


B3 


B9 


^ 


^ 


B4 


BA 


n 


T 


B5 


BB 


^ 


TET 


B6 


BC 


TS 


•5 


B7 


BD 


^ 


^ 


B8 


BE 


■9 


ir 


B9 


BF 


•3T 


^ 


BA 



EC 


- 


- 


E9 


El 


T 


T 


DA 


E2 


f 


f 


DB 


E3 


^ 


■\ 


DC 


E4 


o 


o 


DD 


E5 


•« 


«k 


DE 


E6 


c 


c 


DF 


E7 


t 


« - 


DF,E9 


E8 




— 


E3 


E9 


^ 


^ 


EO 


EA 


:x. 


^ 


El 


EB 


^ 


^ 


E2 


EC 


T 


T 


E7 


ED 


> 


1 


E4 


EE 


> 


> 


E5 


EF 


> 


> 


E6 



*1 



CO 


F 


F 


BB 


CI 


^ 


ir 


BC 


C2 


z 


z 


BD 


C3 


H 


s 


BE 


C4 


T 


•5 


BF 


C5 


T 


T- 


BF,E9 


C6 


^ 


« 


CO 


C7 


^ 


"5" 


C0,E9 


C8 


^ 


nr 


CI 


C9 


rr 


w 


C2 


CA 


«f 


«T 


C3 


CB 


^ 


^ 


C4 


CC 


^ 


^ 


C5 


CD 


^ 


"^ 


C6 


CE 


•T 


^ 


C7 


CF 


^ 


7 


C8 



FO 


•N 


— 


E8.E8 


F1 








F2 


LNK 


-s 


E8 


F3 


"^ 


^T 


C4,CF 


F4 


T. 


^3 • 


CF,DD,2E 


F5 


a 


T - 


CF,E8 


F6 


«fr 


^-^ 


B3,E8,D6 


F7 


■% 


7T-T 


C2,E8.CF 


F8 


¥ 


•3T--3r 


BA.E8,BC 


F9 


«r 


^~^ 


D5,E8,CF 


FA 






ignored 


FB 






Ignored 


FC 






Ignored 


FD 






Ignored 


FE 






Ignored 


FF 






Ignored 



Not«s:lt is necessary to collect a whole iSSCII-83 syllable before converting it to ISCII. 
*1 ISCII Nukta should always be kept immediately after the preceding consonant. 
•2 When LINK is followed by a l^^atra, conversion should be done according to the LINK TABLE. 
*3 -^ (Reph) of ISSCII-83 should be placed as T ~ of ISCII, at the beginiNg of the syllable. 

LINK TABLE (For conversion of tSSCII-83 LINK followed by a Matra to an ISCII Vowel) 



E1 


T 


3IT 


A5 




E6 


c 


^ 


AA 




EB 


^ 


^ 


AD 


E2 


f 


T 


A6 




E7 


T 


w 


AA.EO 




EC 


1 


3ff 


B2 


E3 


> 


f 


A7 




E8 


- 


^ 


AE 




ED 


1 


# 


AF 


E4 


'<^ 


? 


A8 




E9 


.^ 


^ 


AB 




EE 


> 


# 


BO 


E5 


•^ 


gj 


A9 




EA 


i. 


^ 


AC 




EF 


> 


^ 


B1 
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CONVERSION OF tSCII TO ISSCII-83 



AO 








A1 


— 


— 


A1 


A2 


-1. 


■j~ 


A2 


A3 


— 


— 


A3 


A4 


3!T 


3T 


A6 


A5 


37T 


LNKT 


F2.E1 


A6 


? 


LNKf 


F2,E2 


A7 


t 


LNK"^ 


F2,E3 


A8 


■5 


LNK- 


F2,£4 


A3 


T, 


LNK- 


F2,E5 


AA 


W 


LNK7 


F2,E6 


AB 


^ 


lnk"" 


F2,E9 


AC 


K 


LNK^ 


F2,EA 


AD 


\ 


LNK^ 


F2,EB 


AE 


^ 


LNK- 


F2,E8 


AF 


^ 


LNK1 


F2,ED 



BO 


# 


LNK> 


F2,EE 


B1 


4 


LNK> 


F2,EF 


B2 


3qf 


LNKT 


F2,EC 


B3 


^ 


^ 


88 


B4 


f§ 


fS 


B9 


B5 


n 


TT 


BA 


B6 


^ 


^ 


BB 


B7 


^ 


^ 


BC 


B8 


^ 


^ 


BD 


89 


a 


9 


BE 


BA 


^ 


^ 


BF 


BB 


"^ 


11 


CO 


BC 


■3T 


of 


CI 


BD 


z 


■J 


C2 


BE 


s 


B 


C3 


BF 


^ 


^ 


C4 



CO 


^ 


^ 


C6 


C1 


"T 


UT 


C8 


C2 


?r 


?r 


C9 


C3 


«T 


ST 


CA 


C4 


^ 


^ 


CB 


C5 


«r 


«r 


CC 


C6 


■^ 


^ 


CD 


C7 


■=T 


■^ 


CD 


C8 


^ 


^ 


CF 


C9 


^ 


^ 


DO 


CA 


"^ 


■^ 


D1 


CB 


^ 


^ 


D2 


CC 


TT 


"IT 


D3 


CD 


■?T 


"JT 


D4 


CE 


IT 


•ZT 


D4 


CF 


T 


T 


D5 



DO 

D1 

D2 

D3 

D4 

D5 

D6 

D7 

D8 

D9 

DA 

DB 

DC 

DD 

DE 

DF 



J 


T 


H 


c^ 


3 


oS 


o? 


oS 


^ 


^ 


^ 


^ 


T? 


TT 


^ 


^ 


^ 


F 


!NV 




T 


T 


f" 


f 


t 


t 


~J 


^ 


- 


- 



D5 

D7 

D8 

D8 

DA 

DB 

DC 

DD 

DE 

Ignored 

E1 

E2 

E3 

E4 

E5 

E6 



EO 


ia. 


b- 


E9 


E1 


1- 


:»- 


EA 


E2 


5. 


4- 


EB 


E3 


^ 


^ 


E8 


E4 


> 


> 


ED 


E5 


> 


> 


EE 


E6 


> 


> 


EF 


E7 


f 


f 


EC 


E8 


— 


LNK 


F2 


E9 


- 


— 


EO 


EA 


1 




2E 


E8 








EC 








ED 








EE 








EF 









FO 








F1 


o 





30 


F2 


I 


1 


31 


F3 


^ 


2 


32 


F4 


^ 


3 


33 


F5 


V 


4 


34 


F6 


H 


5 


35 


F7 


^ 


6 


36 


F8 


V9 


7 


37 


F9 


c 


8 


38 


FA 


% 


9 


39 


FB 








FC 








FD 








FF 








FF 









CONJUNCT TABLE 



A1,E9 


- - 




32, AO 


EA,E9 


1 ~ 




J A5 


GF,E8 


^ 7 




- F5 


B3,E8,D6 


^ ~ 


^ 


^ F6 


C2,E8.CF 


cT - 


T 


^ F7 


BA,E8,BC 


^ - 


■ST 


IT F8 


D5.E8,CF 


?T ~ 


J 


af F9 ■ 



Notes: It is necessary to collect a whole ISCII syllable betore translating 
it to its corresponding ISSCII-83 syllable . 

*1 A double Halant of ISCII gets converted to a single Halant of 
ISSCII-83. A single Halant before a non-consonant gets converted to 
*3 a single Halant of ISSCII-83, followed by the non-consonant. 

*2 If Nukta comes before Halant, ignore the Nukta. If the Nukta 
comes before a Matra, then send the Nukta after the syllable. 

*3T ~ of ISCII detected at the begining of an ISCII syllable, has to be put 
after the last consonant of the syllable. But if the Matra is present, it has 

to be put immediately after it. 



30 



1813194:1991 



Formatting 5-bit coded bytes from 8-blt coded 
characters : 

1 . 8-bit ISCII or ISSCII-83 code for a character 



b7 


b6 


b5 


t>4 


tj3 


62 


b1 


bO 



2. Splitting the code into two nibbles 



b7 


b6 


bS 


b4 


b3 


b2 


b1 


bO 



3. Adding the flag bits 

I l] b7[b6|b5]b4] 

4. Adding start & stop bits 



0|b3l b2lbll bo{ 



stop 


1 


b7 


b6 


b5 


b4 


start 



stop 



b3 b2 



bl 



bO 



start 



5, Formation of two sequential 7.5 bit bytes 



stop 


1 


b7 


b6 


b5 


b4 


start 



stop 





b3 


b2 


bl 


bO 


start 



Second-byte - -> 



First-byte 



Protocol for change over from CCITT to iSSCil-83 



BILINGUAL 
INITIATER 



Indicator 
Plasties 

Indicator On 
ISSCII-83 Mode 



FROM CCITT to ISSCII-83 



Send "HHHH" in CCITT 
■ > - 



Send "DDDD" in CCITT 



Send "OK" Acknowledge 

In ISSCII-83 
— > 



BILINGUAL / 
MULTILINGUAL 
RESPONDING 
MACHINE 



Indicator 
Flashes 

ISSCII-83 Mode 



Indicator On 



FROM ISSCII-83 to CCITT ; 

Indicator Oft Send two Avagrah: S5 

CCITT Mode -— > Indicator Off 

WRU in CCITT CCITT Mode 

< \ 

Answer Back in CCITT 
1 > — 

CALL ESTABLISHED 
IN CCITT ' 



Protocol for change over from CCiTT to ISSCil-83 



MULTILINGUAL 
INITIATER 



Indicator 
Flashes 

Indicator On 
SSCII 83 Mode 


FROM CCITT to ISSCII 83 


Send "ZHHHH" in CCITT 

> 

Send "DDDD" in CCITT 

- < 

Send "OK" Acknowledge 

In ISSCII-83 
— > 






Indicator Oft 
CCITT Mode 


FROM ISSCII-83 to CCITT 


Send two Avagrah: 55 

-> 

WRU in CCITT 

< 

Answer Back in CCITT 

— — -> 

CALL ESTABLISHED 
IN CCITT 



BILINGUAL 

RESPONDING 

MACHINE 



Indicator 
Flashes 



ISSCII 83 Mode 
Indicator On 



Indicator Off 
CCITT Mode 



Protocol for change over from CCiTT to ISCii 



MULTILINGUAL 

INITIATING 

MACHINE 



MULTILINGUAL 

RESPONDING 

MACHINE 



Indicator 
Flashes 

Indicator On 
ISCII Mode 



FROM CCITT to ISCII 



Send "ZHHHH" in CCITT 



Send "I 1 1 I" in CCITT 



Send 3 Script-Mnemonic 
characters In ISCII 



Indicator 
Flashes 

ISCII Mode 



Indicator On 



Indicator Off 
CCITT Mode 



FROM ISCII to CCITT 



Send "3?t^3?f in ISCII 
>— 

WRU in CCITT 



Answer Back in CCITT 



CALL ESTABLISHED 
IN CCITT 



Indicator Oft 
CCITT t^ode 
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H-2i Bilingual to Bilingual /Multilingual 
Protocol 

H-2.1 After the call is established wit:i verification of the called 
party identity by WRU exchange, the initiating machine sends 
"HHHH" mode-change sequence to the responding-machine 
and an indicator flashes. 

H-2.2 On receiving "HHHH" sequence, the responding-ma- 
chine sends "DDDD" identification sequence in CCITT code 
and its modes changes to lSSCII-83. Its indicator starts flash- 
ing. 

H-2.3 If the initiating-machine receives identification sequence 
"DDDD" correctly, it changes its mode to ISSCII-83 and sends 
"OK" to the called-machine. Now the indicator on the initiating- 
machine lights up continuously. 

H-2.4 If "DDDD" is not received by the initiating-machine for 2 
seconds, the indicator goes off, the machine reverts back to the 
CCITT mode, and step 1 is repeated. This sequence is re- 
peated twice in case of the automatic mode. 

H-2.5 On receiving "OK" in ISSCII-83, the indicator of the re- 
sponding-machine becomes continuously on. 

H-2.6 If "OK" is not received by the responding-machine for 2 
seconds, the mode of the machine changes over to CCITT, 
indicator switches off and the sequence 1 is repeated twice in 
case of the automatic mode. 

H-2.7 To change from ISSCII-83 to CCITT, either of the ma- 
chines sends " 55 " (Two Avagrah) sequence. The initiating- 
machine switches off its indicator, and reverts to the CCITT 
mode. 

H-2.8 On receipt of the change-over sequence " 55 ", the re- 
ceiving-machine changes over to CCITT mode, switches off its 
indicator and sends WRU code back. 

H-2.9 The receipt of answer-back in CCITT serves as confirma- 
tion of change over to CCITT mode of the other machine. 

H-2.1 In case of answer-back failure in manual or auto-mode, 
the call clears down. 



H-3 Multilingual Machines 

A multingual machine always provides all the 1 Brahmi-based 
Indian scnpts. These scripts can be typed in a common manner 
through the Inscript keyboard overlay. In addition there can be 
different overlays for some scripts. 

SCRIPT-MNEMONIC TABLE 



Assamese 


AAA 




Bengali 


BBB 


Devanagari 


ODD 




Gujarati 


GGG 


Kannada 


KKK 




Telugu 


LLL 


Malayalam 


MMM 




Oriya 


OOO 


Punjabi 


PPP 




Tamil 


1 1 1 



An initiating machine indicates the default script to a respond- 
ing machine through a 3 character Script-Mnemonic, as de- 
fined in the Script-Mnemonic Table. 

Although each line starts with the default script, it is^ssible to 
select other scripts within a line through the Attribute character 
(ATR) defined in the iSCll code. All the script attributes wii(, 
however, terminate at the end of a line, and the next line will start 
with the default script. 

The ATR character also allows selection of different display 
attributes, like bold, italics and uiiderline. These attributes are 
always off at the beginning of a tine, and then work on a toggle 
basis. 

H-4 Multilingual to Multilingual Protocol 

H-4.1 After the call is established with verification of the called 
party identity by WRU exchange, the initiating machine sends 
"ZHHHH" mode-change sequence to the responding machine 
and an indicator flashes. 

H-4.20n receiving "ZHHHH" sequence, the responding-mach- 
ine sends "1 1 11" identification sequence in CCITT code and its 
mod*; changes to ISCll. Its indicator starts flashing. 

H-4.3 If the initiating-machine receives identification sequence 
"1 1 11" correctly, it changes its mode to ISCI I and sends a default 
script mnemonic, as specified in the Script-Mnemonic table. 
Now the indicator on the initiating machine lights up continu- 
ously. 

H-4.4 If "DDDD" is received by the initiating-machine then the 
interaction proceeds as defined for two bilingual machines, If 
"1 1 11" is not received for 2 seconds the indicator goes off, the 
machine reverts back to the CCITT mode and the sequence 1 
is repeated twice in case of of the automatic mode. 

H-4.5 On receiving a 3 character script-mnemonic in ISCll, the 
indicator of the responding-machine becomes continuously on. 
As the characters are repeated thrice within the script-mne- 
monic, it is possible to detect the default script if one of the 
characters is in error. 

H-4.6 If a valid script-mnemonic is not received by the re- 
sponding-machine for 2 seconds, the mode of the machine 
changes over to CCITT, indicator switches off and the se- 
quence 1 is repeated twice in case of the automatic mode. 

H-4.7 To change from ISCll to CCITT, either of the machines 
sends "3?l^3ir sequence. The initiating-machine switches off its 
indicator, and reverts to the CCITT mode. 

H-4.8 On receipt of the change-over sequence "3?l^3jr the 
receiving-machine changes over to CCITT mode, switches off 
its indicator and sends WRU code back. 

H-4.9The receipt of answer-back in CCITT serves as confirrr 
Hon of change over to CCITT mode of the other machine. 

H-4.10 In case of answer-back failure in manual or auto-moc 
the call clears down. 
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Telephones : 331 01 31, 331 13 75 

Regional Offices : 

Central : Manak Bhavan. 9 Bahadur Shah Zafar Mars 
NEW DELHI 110002 

Eastern : 1/14 C. I. T. Scheme VII M, V. I. P. Road, Maniktola 
CALCUTTA 700054 

Northern : SCO 445-446, Sector 35-C, CHANDIGARH 160036 

Southern : C. I. T. Campus, IV Cross Road, MADRAS 600113 

Western : Manakalaya, E9 MIDC, Marol, Andhcri ( East ) 
BOMBAY 400093 



Telegrams : Manaksanstha 
( Common to all Offices ) 

Telephone 

f331 01 31 
{331 13 75 

f 37 84 99, 37 85 61 
{37 86 26, 37 86 62 



53 38 43, 53 16 40 
S3 23 84 



\ 

C235 02 16. 235 04 42, 
(235 15 19, 235 23 15 

C632 92 95, 632 78 58. 
{632 78 91, 632 78 92 

Branches : AHMADABAD, BANGALORE, BHOPAL, BHUBANESHWAR, COIMBATORE 
FARIDABAD, GHAZIABAD, GUWAHATI, HYDERABAD, JAIPUR, KANPUH 
LUCKNOW, PATNA, THIRUVANANTHAPURAM. 
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